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" STEM CELL 

The invention relates to a method to modulate the differentiation state of embryonic 
stem cells. 

5 

During mammalian development those cells that form part of the embryo up until the 
formation of the blastocyst are said to be totipotent (e.g. each cell has the 
developmental potential to form a complete embryo and all the cells required to 
support the growth and development of said embryo). During the formation of the 
10 blastocyst, the cells that comprise the inner cell mass are said to be pluripotential 
(e.g. each cell has the developmental potential to form a variety of tissues). 

Embryonic stem cells (ES cells, those with pluripotentiality) may be principally 
derived from two embryonic sources. Cells isolated from the inner cell mass are 

15 termed embryonic stem (ES) cells. In the laboratory mouse, similar cells can be 
derived from the culture of primordial germ cells isolated from the mesenteries or 
genital ridges of days 8.5-12.5 post coition embryos. These would ultimately 
differentiate into germ cells and are referred to as embryonic germ cells (EG cells). 
Each of these types of pluripotential cell has a similar developmental potential with 

20 respect to differentiation into alternate cell types, but possible differences in 
behaviour (eg with respect to imprinting) have led to these cells to be distinguished 
from one another. Hereinafter embryonic stem cells will encompass both these stem 
cell - types. 

25 Typically ES cell cultures have well defined characteristics. These include, but are 
not limited to; maintenance in culture for at least 20 passages when maintained on 
fibroblast feeder layers; produce clusters of cells in culture referred to as embryoid 
bodies; the ability to differentiate into multiple cell types in monolayer culture; and 
express ES cell specific markers. 
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Until very recently, in vitro culture of human ES cells was not possible. The first 
indication that conditions may be determined which could allow the establishment of 
human ES cells in culture is described in W096/22362. The application describes 
5 cell lines and growth conditions which allow the continuous proliferation of primate 
ES cells which exhibit a range of characteristics or markers which are associated with 
stem cells having pluripotent characteristics. 

More recently Thomson et al (1998) have published conditions in which human ES 
10 cells can be established in culture. The above characteristics shown by primate ES 
cells are also shown by the human ES cell lines. In addition the human cell lines 
show high levels of telomerase activity, a characteristic of cells which have the 
ability to divide continuously in culture in an undifferentiated state. Another group 
(Reubinoff et. al., 2000) have also reported the derivation of human ES cells from 
15 human blastocysts. A third group (Shamblott et. al., 1998) have described EG cell 
derivation. 

A feature of ES cells is that, in the presence of fibroblast feeder layers, they retain the 
ability to divide in an undifferentiated state for several generations. If the feeder 

20 layers are removed then the cells differentiate. The differentiation is often to 
neurones or muscle cells but the exact mechanism by which this occurs and its 
control remain unsolved. It would be desirable to have a reliable culture system 
which does not require the presence of fibroblast feeder cells but includes the 
addition of a factor(s) which maintain ES cells in an undifferentiated state. A 

25 prerequisite to the successful exploitation of ES cells in tissue engineering is to 
provide a reliable and defined cell culture system which can be used to control the 
differentiation of ES cells into a selected cell-type. The identification of gene targets 
involved in maintaining ES cells as ES cells and the identification of gene targets 
involved in differentiation will facilitate this objective. 

30 
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We have identified a regulatory pathway involved in the mechanism by which ES 
cells are maintained as ES cells in culture and which also influences the 
differentiation of said cells in culture. The regulatory pathway comprises two 
families of genes referred to as Notch and Writ. 

5 

The Notch gene is a Drosophila prototype for a family of homologues found in 
diverse species, encoding large, single-span, transmembrane receptors (reviewed' in 
-Weinmaster, 199-7). Within the extracellular domain, located distally from the 
transmembrane region, sure found multiple (10-36), tandem arrays of epidermal 

10 growth factor-like repeats (Wharton et al., 1985; Kopezynski et al., 1988). More 
proximally are found 3 cysteine-rich, Lin- 12/Notch repeats and two conserved 
cysteine residues. /The intracellular domain contains, from proximal to distal with 
respect to the transmembrane region, a subtransmembrane region (STR), six ankyrin 
repeats and a region rich in proline, glutamic acid, serine and threonine (PEST). The 

1 5 generic Notch structure is illustrated in Figure 1 . 

Wnt genes encode diffusible, extracellular signalling molecules of around 350-400 
amino acids in length, defined by a characteristic pattern of conserved cysteine 
residues, along with other . invariant amino acids (see 
20 http://www.stanford.edu/~m 

In the 1970s, the wingless (wg l ) mutation of Drosophila melanogaster was 
described, in which affected individuals showed aberrant wing and haltere 
development (Sharma, 1973; Sharma and Chopra, 1976). When the gene disrupted by 

25 this mutation was subsequently identified, the predicted 468aa peptide sequence 
exhibited remarkable similarity to that of a murine gene, int-1 (Cabrera et al., 1987; 
.Rijsewijk et al., 1987), including an identical pattern of 23 conserved cysteine 
residues, int-1 had earlier been identified as a common integration site of the murine 
mammary tumour virus, and a likely cellular oncogene (Nusse and Vaimus, 1982; 

30 van Ooyen and Nusse, 1984). Thus, the two prototypic members of the Wnt gene 
family were described. Since that time, numerous homologues of wingless/int-1 have 
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been identified in divergent organisms, including Caenorhabditis elegans, 
Drosophila melanogaster, Xenopus laevis, chicken, mouse and humans (reviewed in 
Cadigan and Nusse, 1997; Wodarz and Nusse, 1998). Lower organisms appear to 
possess a limited repertoire of Wnt genes in comparison to higher organisms, 
5 presumably reflecting their lesser developmental complexity. Additionally, 
vertebrates appear to express multiple, closely related orthologues of certain Wnts. 
The Wnt family is composed of more than 60 members, with 14 human homologues 
. alone. Well-documented roles exist for Wnt signalling in a variety of developmental 
processes, including cell fate specification and patterning within the central nervous 
10 system. 

Wnt ligands interact with membrane-bound receptors of the frizzled family, leading 
to activation of a cytoplasmic protein, Dishevelled. Dishevelled inhibits Notch 
activation (2) and also inhibits the activity of an Axin-APC-GSK-3b complex, 

15 promoting formation of a bipartite transcriptional activator comprising b-catenin and 
TCF (4). Wnt signalling may be antagonised by extracellular molecules that compete 
for Wnt binding, including frizzled related proteins (FRP), Wnt inhibitory factors 
(WDF), Dickkopf and Cerberus. Expression of Wnt target genes may also be regulated 
by other proteins that bind to and alter the function of TCF. CREB-Binding Protein 

20 (CBP) exhibits a mutually antagonistic binding affinity for TCF with b-catenin and 
converts TCF into a repressor of target genes (8). Additionally, Notch activation may 
induce transcriptional repression by TCF, even in the presence of b-catenin, through 
expression of the TLE class of putative target genes (5,7). 

25 As. a model system to test the involvement of Notch and Wnt genes in the 
differentiation of ES cells we have used embryonal carcinoma cells which are stem 
cells of teratocarcinomas. The stem cells of early embiyos and the stem cells of 
teratocarcinomas have been demonstrated experimentally to be capable of 
substituting for one another in their respective roles. Thus, an embryonic stem cell 

30 introduced to a syngeneic host may give rise to a teratocarcinoma containing all of 
the elements that would be found in a spontaneous tumour of this type (Mintz et al 
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1978). Likewise, embryonal carcinoma cells derived from a spontaneous germ cell 
carcinoma may participate in embryonic development, and generate normal somatic 
tissue following injection into a blastocyst (Brinster 1974; Mintz and Dlmensee 1975; 
Papaioannou et al 1975). This clearly demonstrates that murine EC cells may respond 
5 to developmental cues in an appropriate manner, and that their differentiation may 
provjde information pertinent to normal embryogenesis. Similarly, human EC cells 
may provide an insight into the processes that regulate human development. 

The TERA2 cell line was derived from a lung metastasis of a human teratocarcinoma 
10 in the mid 1970s (Fogh and Trempe, 1975). Morphologically, TERA2 cultures are 
quite divergent from the characteristic EC phenotype and display significant 
heterogeneity, suggesting that these cells undergo spontaneous differentiation 
(Andrews et al., 1980). However, a tumour containing both embryonal carcinoma 
cells and differentiated derivatives was produced following injection of TERA2 into 
15 a nude mouse host (Andrews et al., 1983a; Andrews et al., 1983b; Andrews et al., 
1984). A cell line established from the EC component of this tumour, named 
NTERA2, closely resembled and maintained the characteristic EC phenotype in 
culture and, unlike the parent line, was able to produce teratocarcinoma in nude mice 
with high frequency (Andrews et al., 1983a; Andrews et al., 1983b; Andrews et al., 
20 1984). Additionally, various subclones of NTERA2 exhibit the ability to 
differentiate extensively in vitro following treatment with chemical inducers (eg 
retinoic acid (RA), HMB A ) (Andrews, 1984; Andrews et al., 1986). 

The expression of human Notch homologues were examined in NTERA2 to 
25 determine their involvement in ES cell differentiation. 

We have discovered that members of the Notch gene family, iVctfcW (Genbank 
accession number AF308602), Notch2 (Genbank accession number NM_024408) 
and Notch3 (Genbank accession number NM_000435) are expressed in EC cells and 
30 NTERA2 cells. Notchl expression was detected as a mKNA band of around 7Kb in 
both EC and differentiated cultures of NTERA2. Notch3, like Notchl, was 
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examined in EC cells. A transcript of around 8Kb was readily detected in all samples. 
The endoderm-specific Notch4 (Genbank accession number XMJ)04207) was not. 

AH three Notch homologies expressed by NTERA2 showed altered transcription 
5 during differentiation in response to retinoic acid. In each case, however, these 
changes were modest and expression was evident in both EC and differentiated 
cultures. The role of the Notch pathway in directing EC/ES differentiation may thus 
depend to a greater extent on the level of signalling activation rattier than the 
abundance of the receptors. In order to investigate this possibility, the expression of 

10 candidate ligands for Notch receptors were examined. For example, dlk (Genbank 
accession number U15979) was detected at high levels in EC cultures, but its 
expression was almost extinguished by 3 days following RA treatment. Low levels 
were also observed through 7 and 14 days post-RA. However, by 21 days, dlk was 
up-regulated to the level seen in EC cultures. These profound changes may reflect an 

15 important role for dlk and other DSL ligands in regulating EC/ES differentiation 
through altered Notch signalling activation. This data is suggestive that the Notch 
signalling pathway is involved in regulating EC cell differentiation and, by 
extrapolation, human ES cell differentiation. 

20 A degenerate PCR strategy was used to investigate the possible expression of novel 
Wnt genes in the NTERA2 system. The expression of a single Wnt gene, Wnt-13, 
was detected in NTERA2. Wnt-13 was absent in EC cells, but showed induction and 
subsequent up-regulation following both retinoic acid and HMBA treatment. Both of 
these agents bring about extensive differentiation of NTERA2, accompanied by the 

25 loss of typical human EC surface markers. 

We have examined the expression of components of the Wnt pathway and of 
transcripts corresponding to other proteins known to interact with Wnt signalling in 
NTERA2 cells. These cells are a model system for aspects of human embryogenesis 
30 and differentiate extensively in vitro in response to chemical inducers. Among the 
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cell types produced following retinoic acid treatment are functional, postmitotic, 
CNS neurons (1,6,17). 

The modulation of the Notch and Wnt signalling pathways may facilitate 
5 manipulation of emhryonic stem cell differentiation. The term modulation refers to 
either the maintenance of embryonic stem cells as embryonic stem cells or the 
facilitation of differentiation of embryonic stem cells along defined cell lineages. 

According to an aspect of the invention there is provided a method to modulate the 
10 phenotype of an embryonic stem cell comprising contacting said cell with a ligand 
binding domain of a polypeptide wherein said domain binds its cognate receptor 
expressed by said cell to modulate said phenotype. 

According to a further aspect of the invention there is provided a method to modulate 
15 the differentiation of an embryonic stem cell comprising: 

i) providing a culture of embryonic stem cells; 

ii) providing at least one ligand, or the active binding fragment thereof, capable 
of binding its cognate receptor polypeptide expressed by said embryonic stem 
cell; 

20 iii) forming a culture comprising embryonic stem cells and said ligand; and 
iv) growing said cell culture. 

hi a preferred method of the invention said ligand is encoded by a nucleic acid 
molecule selected from the group consisting of: 
25 i) a nucieic acid molecule as represented in Figure 22; 

ii) a nucleic acid molecule which hybridises to the nucleic acid in (i) and 
which encodes a ligand capable of binding a Wnt receptor; and 

iii) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (i) and (ii) above. 
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In a preferred method of the invention said ligand is selected from the group 
consisting of: WNT 1; WNT 2 , WNT 3; WNT 4; WNT 5 A; WNT 6; WNT 7A; 
WNT 8B; WNT 10B; WNT 11; WNT 14; WNT 16. . ,'. 

5 In a further preferred method of the invention said ligand is WNT 13 . 

In an alternative preferred method of the invention said ligand is encoded by a 
nucleic acid molecule selected from the group consisting of: 

i) a nucleic acid molecule as represented in Figures 2, 4, 5, 7, 10, 12, 14, 16, or 
10 18. ■ • 

ii) a nucleic acid molecule which hybridises to the nucleic acid in (i) and which 
encodes a ligand capable of modulating embryonic stem cell differentiation; 
and 

iii) nucleic acid molecules which are degenerate as a result of the genetic code to 
15 the sequences defined in (i) and (ii) above. 

In a further preferred method of the invention said ligand is selected from the group 
represented by the amino acid sequences in Figures 3, 6, 8, 9, 11, 13, 15, 17, 19, or 
polypeptide variants thereof. 

20 

Polypeptide variants are polypeptide sequences having at least 75% identity with the 
polypeptide sequences as herein disclosed, or fragments and functionally equivalent 
polypeptides thereof. In one embodiment, the polypeptides have at least 85% identity, 
more preferably at least 90% identity, even more preferably at least 95% identity, still 
25 more preferably at least 97% identity, and most preferably at least 99% identity with 
the amino acid sequences illustrated herein. 

In a further preferred method of the invention said cells are induced to differentiate 
by the addition of at least one agent selected from the group consisting of: retinoic 
30 acid; HMBA ; bone morphogenetic proteins ; bromodeoxyuridine; lithium; sonic 
hedgehog. 
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Optionally the inducing agent and the ligand are added simultaneously to a culture of 
embryonic stem cells. Alternatively, the ligand is added before addition of said 
inducing agent. 

5 According to a further aspect of the invention there is provided a method for 
modulating the differentiation of embryonic stem cells comprising: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
group consisting of: 

a) a nucleic acid molecule as represented in Figures 2, 4, 5,7, 10, 12, 14, 16, 18. 
10 b) a nucleic acid molecule which hybridises to the nucleic acid in (ii) and which 

encodes a ligand capable of modulating embryonic stem cell differentiation; and 
c) nucleic acid molecules which are degenerate as a result of the genetic code to 

the sequences defined in (a) and (b) above. 

ii) forming a culture comprising the cell identified in (i) above with an 
15 • ' , embryonic stem cell; and 

iii) growing said culture under conditions suitable for the maintenance and/or 
differentiation of said embryonic stem cell. 

According to a yet further aspect of the invention there is provided a method for 
20 modulating the differentiation of embryonic stem cells comprising: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
group comprising: 

a) a nucleic acid molecule as represented by the sequence in Figure 22; 

b) a nucleic acid molecule which hybridises to the nucleic acid in (a) and 
25 which encodes a ligand capable of binding a Wnt receptor; and 

c) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (a) and (b) above. 

ii) forming a culture comprising a cell as identified in (i) above with an 
embryonic stem cell; and ■ 

30 iii) growing said culture under conditions suitable for the maintenance and/or 
differentiation of embryonic stem cells. 
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In a preferred method of the invention said cell expresses Wnt-13. 

Optionally the cells expressing the ligand(s) are mixed with a culture of 
5 undifferentiated embryonic stem cells. This is followed by addition of the inducing 
agent ( eg retinoic acid; HMBA, bone morphogenetic proteins; bromodeoxyuridine; 
htMum; sonic hedgehog). / 

In a preferred method of the invention said nucleic acid molecule hybridises under 
10 stringent hybridisation conditions to the nucleic acid molecules represented in (a), (b) 
or (c) above. 

Stringent hybridisation or washing conditions are well known in the art. For example, 
nucleic acid hybrids that are stable after washing in O.lxSSC, 0.1% SDS at 60°C. It is 
- 15 well known in the art that optimal hybridisation conditions can be calculated if the 
sequence of the nucleic acid is known. For example, hybridisation conditions can be 
determined by the GC content of the nucleic acid subject to hybridisation. Please see 
Sambrook et al (1989) Molecular Qoning; A Laboratory Approach, A common 
formula for. calculating the stringency conditions required to achieve hybridisation 
20 between nucleic acid molecules of a specified homology is: 

T m = 81.5° C + 16.6 Log [Na*] + 0.41[ % G + C] -0.63 (%formamide) 

25 In a further preferred method of the invention the nucleic acid molecule is genomic 
DNAorcDNA. 

In a preferred method of the invention the nucleic acid molecule encodes a ligand of 
human origin. 

30 

In a further preferred method of the invention said embryonic stem cells are of human 
origin. 

1 rv 
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In a yet further preferred method of tie invention the ceU transfected wi 
acid according to the invention is a mammalian cell. Preferably the cell is selected 
from the following group: a Chinese hamster ovary cell; murine primary fibroblast 
5 cell; human primary fibroblast cell; transformed mouse fibroblast cell-line STO. 

According to a further aspect of the invention there is provided a method for 
inhibiting the differentiation of embryonic stem cells or embryonal carcinoma cells 
comprising: 
10. - ■ ■ 

i) providing at least one polypeptide, or active fragment thereof, wherein said 
polypeptide is an inhibitor of the Wnt signalling pathway; 

ii) forming a culture comprising the polypeptide identified in (i) above with an 
embryonic stem cell; and 

15 iii) growing said culture under conditions suitable for the maintenance of 
embryonic stem cells in an undifferentiated state. 

In a preferred method of the invention said inhibitor of Wnt signalling is selected 
from the group comprising the active binding fragments thereof of the following 
20 polypeptides: frizzled related polypeptides (FRP); Wnt Inhibitory Factors (WIF); 
Dickkopf; Cerebrus. 

In a further preferred method of the invention said inhibitor of Wnt signalling is 
selected from the group comprising the active binding fragments thereof of the 
25 following polypeptides: SFRP1; SFRP4; FRZB; SFRP2; FZD1; FZD2; FZD9; FZD3; 
FZD5; FZD4; FZD6; FZD7; DVL2; DVL3; GSK3B; AXBSfl; APC; TCF1; W-l; 
CER 1; DKK1-4; SARP 2; SARP 3. 

According to a further aspect of the invention there is provided a method for 
30 inhibiting the differentiation of embryonic stem cells or embryonal carcinoma cells 
comprising: 
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i) * providing a cell transfected with a nucleic acid molecule selected from the 

group consisting of: 

a) a nucleic acid molecule encoding a Wnt inhibitory polypeptide; 
5 b) a nucleic acid molecule which hybridises to the nucleic acid in (a) and 

which encodes a polypeptide capable of inhibiting Wnt signalling; and 
c) nucleic acid molecules which are degenerate as a result of the genetic 

code to the sequences defined in (a) and (b) above, 

ii) contacting the cell of (i) above with a culture of embryonic stem cells; and 

10 iii) growing said culture under conditions suitable for the maintenaace of 
embryonic stem cells in an undifferentiated state. 

In a preferred method of the invention said cells express at least one Wnt inhibitory 
polypeptide selected from the group comprising the active binding fragments thereof 

15 of the following polypeptides: frizzled related polypeptides (FRP); Wnt Inhibitory 
Factors (WEF); Dickkopf; Cerebrus. Preferably said cells express at least one Wnt 
inhibitory polypeptide selected from the group comprising the active binding 
fragments thereof of the following polypeptides: SERPI; SFRP4; FRZB; SFRP2; 
FZD1; FZD2; FZD9; FZD3; FZD5; FZD4; FZD6; FZD7; DVL2; DVL3; GSK3B; 

20 AXIN1; APC; TCF1; WIF-1; CER-1; DKK1-4 

In a further preferred method of the invention the nucleic acid molecule is encoded 
by a nucleic acid molecule which hybridises under stringent hybridisation conditions 
to the nucleic acid molecules represented in (a), (b) or (c) above. Preferably said 
25 inhibitors are human. 

According to a further aspect of the invention there is provided a vector comprising 
the nucleic acid molecule according to the invention. Preferably the vector is an 
expression vector adapted for the expression of the polypeptide encoded by said 
30 nucleic acid molecule. 
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Typically said adaptation includes, by example and not by way of limitation, the 
provision of transcription control sequences (promoter sequences) which mediate 
cell/tissue specific expression. These promoter sequences may be cell/tissue specific, 
. inducible or constitutive. 

5 

Promoter is an art recognised term and, for the sake of clarity, includes the following 
features which are provided by example only, and not by way of limitation. Enhancer 
elements are cis acting nucleic acid sequences often found 5 v to the transcription 
initiation site of a gene ( enhancers can also be found 3' to a' gene sequence or even 

10 located in intronic sequences and is therefore position independent). Enhancers 
function to increase the rate of transcription of the gene to which the enhancer is 
linked. Enhancer activity is responsive to trans acting transcription factors 
(polypeptides) which have been shown to bind specifically to enhancer elements. The 
binding/activity of transcription factors (please see Eukaryotic Transcription Factors, 

15 by David S Latchman, Academic Press Ltd, San Diego) is responsive to a number of 
environmental cues which include, by example and not by way of limitation, 
intermediary metabolites (eg glucose, lipids), environmental effectors (eg light, 
heat,). 

20 Promoter elements also include so called TATA box and RNA polymerase initiation 
selection (RIS) sequences which function to select a site of transcription initiation. 
These sequences also bind polypeptides which function, inter alia, to facilitate 
transcription initiation selection by RNA polymerase. 

25 Adaptations also include the provision of selectable markers 
and autonomous replication sequences which both facilitate the maintenance of said 
vector in either the eukaryotic cell or prokaryotic host. Vectors which are maintained 
autonomously are referred to as episomal vectors. Episomal vectors are desirable 
since these molecules can incorporate large DNA fragments (30-50kb DNA). 

30 Episomal vectors of this type are described in WO98/07876. Alternatively, the 
vector is an integrating vector. 
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Adaptations which facilitate the expression of vector encoded genes include the 
provision of transcription tenninatioii/polyadenylation sequences. This also includes 
the provision of internal ribosome entry sites (IRES) which function to maximise 
5 expression of vector encoded genes arranged in bicistronic or multi-cistronic 
expression cassettes. 

These adaptations are well known in the art. There is a significant amount of 
pubhshed literature with respect to expression vector construction and recombinant 
10 DNA techniques in general. Please see, Sambrook et al (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbour Laboratory, Cold Spring Harbour, NY and 
references therein; Marston, F (1987): DNA Cloning Techniques: A Practical 
Approach Vol DI IRL Press, Oxford UK; DNA Cloning: F M Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc.(1994). 

15 

. Conventional methods to introduce DNA or vector DNA into cells are well known in 
the art and typically involve the use of chemical reagents, cationic lipids or physical 
methods. Chemical methods which facilitate the uptake of DNA by cells include the 
use of DEAE -Dextran ( Vaheri and Pagano Science 175: p434) . DEAE-dextran is a 
20 negatively charged cation which associates and introduces the DNA into cells but 
which can result in loss of cell viability. Calcium phosphate is also a commonly used 
chemical agent which when co-precipitated with DNA introduces the DNA into cells 
(Graham etal Virology (1973) 52: p456). • 

25 The use of cationic lipids (eg liposomes, Feigner (1987) Proc.Natl.Acad.Sci USA, 
84:p7413) has become a common method since it does not have the degree of 
toxicity shown by the above described chemical methods. The cationic head of the 
lipid associates with the negatively charged nucleic acid backbone of the DNA to be 
introduced. The Hpid/DNA complex associates with the cell membrane and fuses 

30 with the cell to introduce the associated DNA into the cell. Liposome mediated DNA 
transfer has several advantages over existing methods. For example, cells which are 
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recalcitrant to traditional chemical methods are more easily transfected using 
liposome mediated transfer. 

More recently still, physical methods to introduce DNA have become effective means 
. 5 to reproducibly transfect cells. Direct microinjection is one such method which can 
deliver DNA directly to the nucleus of a cell ( Capecchi (1980) Cell, 22:p479). This 
allows the analysis of single cell transfectants. So called "biolistic" methods 
physically shoot DNA into cells and/or organelles using a particle gun (Neumann 
(1982) EMBO J, 1: p841). Eleetroporation is arguably the most popular method to 
10 transfect DNA. The method involves the use of a high voltage electrical charge to 
momentarily permeabilise cell membranes making them permeable to . 
macromolecular complexes. However physical methods to introduce DNA do result 
in considerable loss of cell viability due to intracellular damage. These methods 
therefore require extensive optimisation and also require expensive equipment. 

15 

More recently still a method tOThed iimtmopofatioh has become a recognised 
techinque for the introduction of nucleic acid into cells, see Bildirici et al, Nature 
405, 769. The technique involves the use of beads coated with an antibody to a 
specific receptor. The transfection mixture includes nucleic acid, typically vector 

20 DNA, antibody coated beads and cells expressing a specific cell surface receptor. The 
coated beads bind the cell surface receptor and when a shear force is applied to the 
cells the beads are stripped from the cell surface. During bead removal a transient 
hole is created through which nucleic acid and/or other biological molecules, eg 
polypeptides, can enter. Transfection efficiency of between 40-50% is achievable 

25 depending on the nucleic acid used. 

Other non-liposome based, chemical transfectant agents have become available, for 
example ExGen500 (polye%lenimine), produced by MBI Fermentas. ExGen500 is 
particularly effective for transfection of human ES cells (Eiges, 2001). 

30 
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According to a further aspect of the invention there is provided a method for the 
production of the polypeptide encoded by the nucleic acid molecule according to the 
invention comprising: 

i) providing a cell transformed/transfected with a nucleic acid molecule 
5 according to the invention; 

• ii) growing said cell in conditions conducive to the manufacture of said 
polypeptide; and 

i) purifying said polypeptide from said cell, or its growth environment 

In a preferred method of the invention said nucleic acid molecule is the vector 
10 according to the invention. 

In a further preferred method of the invention said vector encodes, and thus said 
recombinant polypeptide is provided with, a secretion signal to facilitate purification 
of said polypeptide. 

15 

According to a further aspect of the invention there are provided host cells which 
have been transformed/transfected with the vector according to the invention, so as to 
include at least part of the polypeptide according to the invention, so as to permit 
expression of at least the functional part of the polypeptide encoded by said nucleic 
20 acid molecule. 

Ideally said host cells are eukaryotic cells, for example, insect cells such as cells from 
a species Spodoptera frugiperda using the baculovirus expression system. 

25 According to a further aspect of the invention there is provided a therapeutic cell 
composition comprising differentiated or differentiating embryonic stem cells 
derived by the method according to the invention. Preferably said composition is for 
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use in the treatment of: Parkinson's disease; Huntington's disease; motor neurone 
disease; heart disease; diabetes; liver disease (eg cirrhosis); renal disease; AIDS. 

According to a further aspect of the invention there is provided a method of treatment 
5 of an animal comprising administering a cell composition comprising embryonic 
stem cells which have been induced to differentiate into at least one cell-type. 

According to a yet further aspect of the invention there is provided condition medium 
obtained by culturing embryonic stem cells according to any of the methods 
10 hereindisclosed. 

An embodiment of the invention will know be described by example only and with 
reference to the following figures: 

15 Figure 1 is a schematic represenation of conserved domains in Notch polypeptides; 

Figure 2 is the nucleic acid sequence of murine notch ligand delta-like 1; 

Figure 3 is the amino acid sequence of murine notch ligand delta-like 1; 

20 

Figure 4 is the nucleic acid sequence of murine notch ligand jagged 1 ; 

Figure 5 is the nucleic acid sequence of human notch ligand jagged 1 (alagille 
syndrome) (JAG1); 

25 

Figure 6 is the amino acid sequence of human notch ligand jagged 1 (alagille 
syndrome); 

Figure 7 is the nucleic acid sequence of human notch ligand jagged 2 (JAG2) 

30 

Figure 8 is the amino acid sequence of human notch ligand jagged 2 (JAG2); 
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Figure ? is the amino acid sequence of murine notch ligand j agged 1; 
Figure 10 is the nucleic acid sequence of murine notch ligand jagged 2; 

5 

Figure 1 1 is the amino acid sequence of murine notch ligand jagged 2; 

Figure 12 is the nucleic acid sequence of human notch ligand delta-like 3 (DLL3); 

10 Figure 13 is the amino acid sequence of human notch ligand delta-like 3 precursor 
. polypeptide; 

Figure 14 is the nucleic acid sequence of human notch ligand delta-1 (DLL1); 
15 Figure 15 is the amino acid sequence of murine notch ligand delta- like 1 ; 

Figure 16 is the nucleic acid sequence of human notch ligand delta-like 4 (DLL4); 
Figure 17 is the amino acid sequence of human notch ligand delta-like 4 (DLL4); 

20 

Figure 1 8 is the nucleic acid sequence of murine notch ligand delta-like 4(DLL4); 

Figure 19 is the amino acid sequence of murine notch ligand delta-like 4(DLL4); 

25 Figure 20 is a western blot of cell extracts of various EC cell-lines probed with 
Notch 2 antisera; 

Figure 21 represents northern blot analysis of the expression patterns of notch genes 
(Notch 1,2,3) and notch ligands (Dlk, jagged 1) in EC cells and EC cells treated with 
30 retinoic acid (RA); 
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Figure 22 represents the nucleic acid sequence of human FFh* 73; 

Figure 23 is a diagramatic representation of the Wnt signalling pathway; 

5 Figure 24 represents northern blot analysis of Wnt 13 and mRNA's corresponding to 
Frizzled receptors and Frizzled related protein antagonists of Wnt signalling in 
NTERA 2 cells various Wnt inhibitors after exposure of NTERA 2 cells; 

Figure 25 represents a northern blot analysis of intracellular components of Wnt 
10 signalling pathway in NTERA 2 cells; 

Figure 26 represents the nucleic acid sequence of human dickkopfl; 

Figure 27 represents the nucleic acid sequence of human dickkopf2\ 

15 

Figure 28 represents the nucleic acid sequence of human dickkopf3\ and 
Figure 29 represents the nucleic acid sequence of human dickkopf4; 
20 Figure 30 represents the nucleic acid sequence of WNT-1 ; 
Figure 31 represents the amino acid sequence of WNTM; 
Figure 32 represents the nucleic acid sequence of WNT-2; 

25 

Figure 33 represents the amino acid sequence of WNT-2; 
Figure 34 represents the nucleic acid sequence of WNT 2B; 
30 Figure 35 represents the amino acid sequence of WNT 2B; 
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Figure 36 represents the nucleic acid sequence of WNT 3; 
Figure 37 represents the amino acid sequence of WNT 3; 
5 Figure 38 represents the nucleic acid sequence of WNT 4; 
Figure 39 represents the amino acid sequence of WNT 4; 
Figure 40 represents the nucleic acid sequence of WNT 5 A; 

10 • 

Figure 41 represents the amino acid sequence of WNT 5A; 
Figure 42 represents the nucleic acid sequence of WNT 6; 
15 Figure 43 represents the amino acid sequence of WNT 6; 
Figure 44 represents the nucleic acid sequence of WNT 7 A; 
Figure 45 represents the amino acid sequence of WNT 7 A; 

20 

Figure 46 represents the amino acid sequence of WNT 7B; 
Figure 47 represents the nucleic acid sequence of WNT 8B; 
25 Figure 48 represents the amino acid sequence of WNT 8B; 
Figure 49 represents the nucleic acid sequence of WNT 10B; 
Figure 50 represents the amino acid sequence of WNT 10B; 

30 

Figure 51 represents the nucleic acid sequence of WNT 11; 
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Figure 52 represents the amino acid sequence of WNT 11; 

Figure 53 represents the nucleic acid sequence of WNT 14 
5 

Figure 54 represents the amino acid sequence of WNT 14; 
Figure 55 represents the nucleic acid sequence of WNT 16; 
10 Figure 56 represents the amino acid sequence of WNT 16; 
Figure 57 represents the nucleic acid sequence of FZD 1 ; 
Figure 58 represents the amino acid sequence of FZD 1; 

15 

Figure 59 represents the nucleic acid sequence of FZD 2; 
Figure 60 represents the amino acid sequence of FZD 2; 
20 Figure 61 represents the nucleic acid sequence of FZE 3; 
Figure 62 represents the amino acid sequence of FZE 3 ; 
Figure 63 represents the nucleic acid sequence of FZD 4; 

25 

Figure 64 represents the amino acid sequence of FZD 4; 
Figure 65 represents the nucleic acid sequence of FZD 5; 
30 Figure 66 represents the amino acid sequence of FZD 5; 

21 
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Figure 67 represents the nucleic acid sequence of FZD 6; 

Figure 68 represents the amino acid sequence of FZD 6; 
5 Figure 69 represents the nucleic acid sequence of FZD 7; 

Figure 70 represents the amino acid sequence of FZD 7; 

Figure 71 represents the nucleic acid sequence of FZD 8; 

10 

Figure 72 represents the amino axad sequence of FZD 8; 
Figure 73 represents the nucleic acid sequence of FZD 9; 
15 Figure 74 represents the amino acid sequence of FZD 9; 
Figure 75 represents the nucleic acid sequence of FZD 10; 
Figure 76 represents the amino acid sequence of FZD 10; 

20 

Figure 77 represents the nucleic acid sequence of FRP; 

Figure 78 represents the amino acid sequence of FRP; 

25 Figure 79 represents the nucleic acid sequence of SARP 1; 

Figure 80 represents the amino acid sequence of SARP 1; 
Figure 81 represents the nucleic acid sequence of SARP 2; 

30 Figure 82 represents the amino acid sequence of SARP 2; 
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Figure 83 represents the nucleic acid sequence of FRZB; 
Figure 84 represents the amino acid sequence of FRZB ; 
5 Figure 85 represents the nucleic acid sequence of FRPHE; 
Figure 86 represents the amino acid sequence of FRPHE; 
Figure 87 represents the nucleic acid sequence of SARP 3; 

10 

Figure 88 represents the amino acid sequence of SARP 3; 
Figure 89 represents the nucleic acid sequence of CER 1; 

i 

15 Figure 90 represents the amino acid sequence of CER 1; 
Figure 91 represents the nucleic acid sequence of DKK1 ; 
Figure 92 represents the amino acid sequence of DKK1; 

20 

Figure 93 represents the nucleic acid sequence of DKK 2; 

Figure 94 represents the amino acid sequence of DKK 2; 

25 Figure 95 represents the nucleic acid sequence of DKK 3; 

Figure 96 represents the amino acid sequence of DKK 3; 
Figure 97 represents the nucleic acid sequence of DKK 4; 

30 Figure 98 represents the amino acid sequence of DKK 4; 
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Figure 99 represents the nucleic acid sequence of WEF-l; 
Figure 100 represents the amino acid sequence of WIF-1; 
5 Figure 101 represents the nucleic acid sequence of SEEP' 1; 
Figure 102 represents the amino acid sequence of SRFP 1; 
Figure 103 represents the nucleic acid sequence of SRFP 4; 

10- . 

Figure 104 represents the amino acid sequence of SRFP 4; and 
Figure 1 05 represents a diagram depicting the pCMV-tracer vector. 

15 Materials and Methods 

Table 1 Cell lines derived from germ cell tumours. 



Cell Line 


Biopsy Site 


Biopsy Histology 


Xenograph 
Histology 


Reference 


2102Ep 


Testis 


EC.T.Y 


" : EC ; 


(Andrews et ah, 
1980) 


833KE 


Testis 


EC,T,C,S 


EC 


(Andrews et al> 
1980) 


TERA-1 


Lung 


EC,T 




(Fogh and 
Trernpe, 1975) 


NTERA2cl. Dl 


Lung 


EC, T 


EC,T 


(Fogh and 
Trempe, 1975) 
(Andrews, 1984) 



Abbreviations used: EC, embryonal carcinoma, T, teratoma, S, seminoma, C, 
choriocarcinoma, Y, yolk-sac carcinoma 
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Cell Lines derived from gestational choriocarcinomas. 

BEWO Coiresponds to gestational choriocarcinoma (Pattillo and Gay, 

1968) 



5 List of Antibodies Used 



Antibody 


Reference 


References 


I SSEA-3 


Andrews et al., 1982 


12 


SSEA-4 


Kannagj et. al., 1983 


18 


Tra-1-60 


Andrews et. al., 1984 


25 


Tra-1-81 


Andrews et. al., 1984 


25 


Tra-2-54 


Andrews et. al., 1984 


20 


Tra-2-49 


Andrews et. al., 1984 


20 


A2B5 


Fenderson et al., 1 987 




ME311 


Fendersonet. al., 1987 




Vin-is-56 


Andrews et. al., 1990 


44 


Vin-is-53 


Andrews et. al., 1990 


. 44 


Vin-2PB-22 


Andrews et. al., 1990 


44 


Thy-1 


Andrews et. al., 1983 


10 



Expression Vectors 

10 The following mammlian expression vectors are used in the expression of ligands 
hereindisclosed: 

Purchased from Stratagene Inc. pExchange-1; pExchange-2; pExchange-3A, 3B, 3C; 
pExchange-4A, 4B, 4C; pExchange-5A, 5b,5C; pExchange-6A, 6B, 6C; pExchange 
module EC-hyg; pExchange module EC-Puro; pExchange module EC-Neo; pCMV- 
15 Script; pCMV-Tagl; pCMV-Tag2; pCMV-Tag3; pCMV-Tag4; pCMV-Tag5; 
pCMVLACI, pOPRSWMCS, pOPD-CAT ; pERV3; pEGSH. 

Purchased from Invitrogen Inv. 
T-REX System vectors 
20 pcDNA4/TO; pcDNA4/TO/myc-His; pcDNA6/TR; pT-Rex-DEST30; pT-Rex- 
DEST31; pcDNA4/TO-E; pcDNA5/FRT/TO; pcDNA5/FRT/TO-TOPO. 
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Geneswitch System vectors 
pGene/V5-His A, B, C; pSwitch 

5 Ecdvsone-Tnducible System 

PVgRXR; pIND; pMD(SPl); pINDA^5-His; pINDAT5-ffis-TOPO; pIND/GFP; 
pIND(SPl)/GFP. 

10 PShooter vectors 

pRF/Myc/Nuc; pCMVTMyc/nuc; pEF/myc/mito; pCMV/myc/mito; pEF/myc/ER; 
pCMV/myc/ER; pEF/myc/cyto; pCMV/myc/cyto. 



15 Tnvitrogen INC 

pTet-off; pTet-on; ptTA-2/ /3 /4; pTet-tTS; pTRE2hyg 
PTRE2pur; pTRE2; pLP-TRE2; PTRE-Myc; pTRE-HA; pTRE-6xHN 
pTRE-d2EGFP; pBI; pBI-EGFP; pBI-G; pBI-L,-pTK-Hyg 

20 

"Livine colours" vectors. 

pDsRed2-Nl; pDsRed2-Cl; pECFP-Nl; pEGFP-Nl; pEGFP-N2; pEGFP-N3 
pEYFP-Nl; pECFP-Cl ; pEGFP-Cl; pEGFP-C2; pEGFP-C3 
25 pEYFP-Cl; pdlEGFP-Nl; pdlECFP-Nl; pd2EGFP-Nl; pd2EYFP-Nl 
pd4EGFP-Nl; pCMS-EGFP; pHygEGFP; pEGFPLuc; pNF-KB-dsEGFP 
pIRES2-EGFP; pIRES-EYFP 

Maintenance of cell lines 

30 

All cells were grown in Dulbecco's modified Eagle's medium (DMEM), 
supplemented with 10% by volume foetal calf serum (Gibco BRL) and 2mM 1^ 
glutamine. Tissue culture flasks were incubated in a humidified atmosphere of 10% 
C0 2 in air at 37°C. 

35 
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Treatment of NTERA2 Cells 
Retinoic acid 

5 Medium was aspirated from confluent flasks of EC cells and the cells rinsed in sterile 
PBS. 1ml of 0.25% (w/v) trypsin in 2mM EDTA was added per 75cm 2 flask and the 
flask incubated at room temperature for up to 5 minutes. Vigorous shaking was 
subsequently used to dislodge the cells. Cells were suspended in 9ml of 
supplemented DMEM per ml of trypsin used and counted in a haemocytometer. Cells 

10 were seeded at 10 6 cells per 75cm 2 flask, in medium containing 10 _5 M aU-trans- 
retinoic acid (Eastman Kodak), diluted from a 10" 2 M stock solution in dimethyl 
sulfoxide (DMSO). Flasks were incubated as described above and the media replaced 
as and when required. 

15 Hexamethvlene bisacetamide (HMBA) 

Cells to be treated with HMBA were prepared as described ibr retinoic acid, but 
grown in medium supplemented with 1 0* 3 M HMBA instead of RA. 

Harvesting of cells 

20 Cells were dislodged from the culture vessel with trypsin and suspended in 9ml 
culture medium per ml of trypsin solution used, as described above. The cell 
suspension was then centrifuged at 400 x g for 3 minutes and the medium aspirated 
from the resulting cell pellet Cells were then rinsed in 5ml PBS and centrifuged 
again at 400 x g for 1 minute. The PBS rinse was aspirated- and the cells stored at - 

25 80°C or used immediately. 

Total RNA preparation 

Where possible, all vessels and all solutions used in RNA preparation and storage 
30 were treated with a 0.01% (w/v) solution of diethylpyrocarbonate (DEPQ in distilled 
water, and subsequently autoclaved. 
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TRI reagent (Sigma) was added to pelleted cells in a quantity corresponding to 1ml 
per 75cm 2 flask. The lysate was agitated until homogenous. 0.2ml of chloroform was 
added per ml of TRI reagent used and the vessel vortexed for 10 seconds. After 10 
minutes at room temperature, the lysate was centrifuged' at 12000 x g for 1 5 minutes 

5 at 4°C. Following centrifugation, the aqueous (uppermost) phase was transferred to a 
fresh vessel and 0.5ml of isopropanol added per ml of TRI reagent used. The sample 
was incubated at room temperature for 10 minutes, then centrifuged at 12000 x g for 
10 minutes at 4°C. Following centrifugation, the supernatant was removed and the 
pellet washed in 70% ethanol. RNA was dissolved in DEPC-treated, double-distilled 

10 water. 

Isolation of mRNA 

lOOmg oligo dT cellulose (Ambion) was suspended in 25ml binding buffer. Up to 
15 2mg of total RNA was then added to the binding buffer and the suspension gently 
agitated at room temperature for 45 minutes. The suspension was then centrifuged at 
3000 x g for 10 minutes and the supernatant discarded. The resulting pellet was re- 
suspended in a further 25ml of binding buffer and agitated at room temperature for 
60 minutes. The suspension was again centrifuged at 3000 x g and the supernatant 
20 discarded. The pellet of oligo dT cellulose was transferred to a spin column using a 
minimal quantity of binding buffer to re-suspend. The column was spun at maximum 
speed in a desktop microfuge for 30 seconds and the eluate discarded. This was 
repeated until the cellulose was dry. 200ul of wash buffer was then added to the 
cellulose and mixed in with a pipette tip. The column was spun at maximum speed 
25 for 1 minute and the eluate discarded. 200ul of DEPC-treated, double-distilled H 2 0 
was then added to the cellulose and mixed in, as before. The column was then spun at 
maximum speed for 2 minutes and the eluted mRNA collected. 

Precipitation of RNA 

30 

To the RNA solution was added O.lx volume of 5M LiCl and 2.5x volume of 100% 
ethanol. After vortexing briefly, the sample was incubated at -20°C for >60 minutes 
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to precipitate. Precipitated RNA was centrifuged at maximum speed in a bench top 
microfuge for 30 minutes. The supernatant was discarded and the pellet rinsed in 
70% ethanol, then dissolved in H2O. 
Quantitation of nucleic acid 

•5 

A Beckman DU 650 spectrophotometer was used for the quantitation of both DNA 
and RNA. The machine was set to measure absorbence at wavelengths of 260nm and 
280nm. After blanking the machine on an appropriate solution, diluted DNA or RNA 
samples in a volume of lOOul were added to the cuvette and measured. The 
10 absorbence at 260nm was used to calculate nucleic acid concentration in \ig/\i\, as 
shown below: 

[Nucleic acid] = (A 260 x N x DF) * 1000 

15 Where N is 33 for single-stranded DNA, 50 for double-stranded DNA and 40 for 
RNA and DF is the dilution factor for the sample added to the cuvette. 

Northern blot analysis 
Blot preparation 

20 1 g of agarose was dissolved in 85ml H 2 0 by boiling. After cooling to around 70°C, 
10ml of lOx MOPS buffer and 5ml of formaldehyde were added, and the gel cast 1- 
5u.g of each mRNA sample was mixed with an appropriate quantity of lOx RNA 
loading buffer to give a final volume of no more than 30ul. The RNA was then 
denatured at 95°C for 2 minutes and quenched on ice for 10 minutes. The gel was 

25 placed in an electrophoresis tank' containing Ix MOPS buffer and the samples loaded 
into each well of the gel, along with appropriate molecular weight markers in the 
outermost wells. 80V were applied across the gel for 2-3 hours or as required. 
Following electrophoresis, the outermost lanes containing the molecular weight 
markers were removed using a scalpel and submerged in double-distilled H2O 

30 containing ethidium bromide at 0.5p.g/ml. The remainder of the gel was submerged in 
>5 volumes of double-distilled H 2 0, which was replaced every 5 minutes for a total 
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of 25 minutes. An appropriately sized piece of GeneScreen Plus (DuPont) membrane, 
just larger than the area of gel to be blotted, was cut. The membrane was hydrated by 
briefly submerging in double-distilled H2O, then transferred to lOx SSC, concurrent 
with fee last 15 minutes of gel washing. The blotting apparatus was assembled as 
5 shown in Figure 2.1, with the gel upside-down, using lOx SSC transfer buffer. After 
transfer of at least 6 hours, the absorbent material was removed from the membrane. 
After marking the position of the wells using a pencil, the membrane was removed 
from the gel and washed briefly in 2x SSC. Whilst still damp, the RNA was fixed to 
the membrane by UV crosslinking. The membrane was then baked at 80°C for 3 
10 hours. 

The' excised marker lanes were de-stained by soaking in a large volume of double- 
distilled H 2 0 for around 3 hours, after which they were visualised on a UV 
transilluminator and photographed. 

15 

Probe preparation 

Random-primed DNA labelling was carried out using the Prime-a-Gene kit from 
Promega. Approximately 25ng of template DNA (PCR or restriction digest product) 
20 was denatured at 95°C for 2 minutes, then quenched on ice for 10 minutes. The 
reaction mix was then assembled on ice, in the order indicated below: 

10|il of 5x labelling buffer 
H 2 0 to give a final volume of 50pl 
25 2^1 unlabelled dNTP mix (0,5mM each) 

25ng of denatured/quenched template DNA 
2|ill0mg/mlBSA 

5jd aP 32 dATP 3000Ci/mmol (NEN DuPont) 
1 (d DNA polymerase 1 large (Klenow) fragment 

30 
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The labelling reaction mix was incubated at room temperature for 2 hours. After this 
period, unincorporated nucleotides were removed using Pharmacia S-300 MicroSpin 
columns. Columns were placed in a microfuge i tube and pre-spun at 735 x g for 1 
minute. The column was then transferred to a fresh tube and the entire labelling 
5 reaction added. The column was then spun at 735 x g for a further 2 minutes and the 
purified, labelled DNA collected. Labelled DNA was denatured at 95°C for 2 
minutes, then quenched on ice for 15 minutes. , 

Hybridisation and washing procedure 

10 . 

Northern blots were equihbrated in 150ml of 2x SSC at 42°C for 15 minutes m a 
hybridisation oven at 8 RPM. The SSC was exchanged for 25ml of hybridisation 
buffer, pre-warmed to 42°C, and the filter incubated for a further 30 minutes at the 
same temperature. The entire volume of purified probe solution was then added to 
15 me hybridisation buffer and the blot incubated overnight at 42°C/ 8 RPM. The 
hybridisation solution was then discarded and the blot washed as follows: 

2x SSC at room temperature for 20 minutes 
2x SSC at room temperature for 20 minutes 
20 2xSSC/l%SDS at 65°C for 45 minutes 

2x SSC/1% SDS at 65°C for 45 minutes 
O.lx SSC at room temperature for 20 minutes 
O.lx SSC at room temperature for 20 minutes 

25 Filters were exposed to a Bio Rad BI phosphor-imager screen overnight and, in most 
cases, subsequently exposed to X-ray film (Kodak X-omat AR). 

Loading controls for Northern blots 

30 All Northern blots used in this study were probed with p-actin as a loading control. 
Table 2.5 (overleaf) lists the figures to which each control probing (panel A to T, 
Figure 2.2) corresponds. Northern blot data presented in this study have not, in all 
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cases, been subject to repeat experiments using RNA isolated from different batches 
of cells. These data may not be regarded as conclusive, since reproducibility has not 
been proven. 

5 Method for Analysis of the Require ment for Notch Ligands in the 
Differentiation of Embryonic Stem. Embryon al Carcinoma and their 
Differentiated Derivatives. 

CHO are transfected with constructs encoding either membrane bound or soluble 
10 forms of the Notch ligands. These cell lines are used to support the growth of either 
Embryonal carcinoma cells (EC) e.g NTERA2/cl.Dl or Human embryonic stem cells 
(hES). 

The transfected CHO cells (CHO(DSL)) are used in the following way. To assess 
15 . membrane bound forms of me Notch ligands the CHO(DSL) cells are used as feeder 
cells (i.e. the EC or hES will be grown on top of the CHO(DSL) cells). To assess 
me soluble forms of the Notch ligands either supernatant from the transfected CHO 
cells or concentrated ligand molecules derived from the supernatant are added to the 
culture medium of the EC and hES cells. 

20 

Notch Ligand Constructs. 

The following cloned Notch ligands were obtained from Dr. Shigeru Chiba, 
Department of Hematology, Oncology and Cell Therapy, Transplantation Medicine. 
25 Graduate School of Medicine. University of Tokyo. 

Deltal-FLAG 

Jaggedl-FLAG 

Jagged2-FLAG 

30 

Soluble Deltal-Fc 
Soluble Jaggedl-Fc 
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Soluble Jagged2-Fc 

These had been cloned into the vector pTRACER-CMV fromlnvitrogen, Fig 30). 

5 The clones used consisted either of the full length ligand linked to a histidine tag 
(FLAG, Kodak Inc.), or a ligand lacking the membrane spanning and intracellular 
portion of the protein thus rendering the ligand soluble. These had been linked to the 
Fc portion of human IgG. 

10 Generation of Notch Ligand expressing Cell lines 

The Chinese Hamster Ovary derived cell line AA8 was maintained in MEM Alpha 
medium with Glutamax-1 supplemented with ribonucleosides and 
deoxyribonucleosides (Lifetechnologjes) and 10% Foetal Bovine Serum 
15 (FBS)(Lifetechnologies). 

Plasmid was transfected into the AA8 cells using either Fugene (Roche) or Lipofectin 
(Lifetechnologjes) or Superfect (Qiagen) according to manufacturers protocols. 

20 Assessment of Transiently Transfected Cell lines for I-igand Production. 

Both soluble and membrane bound forms of the Notch ligand's production are 
assayed by western blotting and chemiluminesent detection. 

25 Cells transfected with the ligand encoding constructs are harvested and the proteins 
extracted. Due to the tagging of the ligands proteins are able to be run out on an 
SDS-PAGE gel, blotted and probed with either mouse anti-FLAG antibody and 
detected using a anti-mouse HRP secondary or an HRP-secondary antibody. Both 
methods use electxo-chermluminecence (ECL) as the detection method. 

30 
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fWentratioh of Soluble Notch ligand fro m the Supernatant of Transfected 
CHO cells. 

Fc-labelled Notch ligand can be purified from transfected CHO cells supernatant 
5 using a HiTrap protein G HP column (Amersham Pharmacia Biotech). A sample can 
be analysed by western blotting as described above. 

Embr yonic Cell culture. 

10 Human Embryonal Carcinoma NTERA2/D1 cells are maintained in Dulbecco's 
modified Eagles medium (DMEM), supplemented wilh 2mM 1-glutamine, 10% 
Foetal Bovine Serum (Ufetechnologies) and at 37°C under 10% C0 2 in air. Cells 
were passaged by scraping from the surface of the tissue culture flask with 3mm glass 
beads and reseeded at 5 x 10 6 cells per 75cm 3 flask. For specific seeding densities 

15 cells were pasaaged using 0.25% Trypsin (Lifetechnologies) in Dulbecco's Phosphate 
Buffered Saline (PBS) supplemented with ImM EDTA. 

Human Embryonic Stem Cells are maintained on irradiated mouse embryonic 
fibroblasts in serum free conditions, with 80% F12:DMEM (Lifetechnologies), 20% 
20 Knockout SR (Lifetechnologies), 1% Non-essential arnino acid solution 
(Lifetechnologies), 1 mM L-glutamine, O.lmM p-mercaptoetanol (Sigma) 4 ng/ml 
bFGF (Sigma). The cells are passaged using collagenase IV and scraping. 
Flow Cvtofluorimetry 

25 Cells were removed from their adherent culture surface and incubated with suitable 
primary antibody for 1 hour at 4C. Cells are washed in PBS with 5% FCS and 
incubated for a further hour with a suitable FITC-conjugated labelled secondary 
antibody, and analysed on a EPICS Elite ESP Flow Cytometer (Coulter Electronics). 
Colonies were assessed for the presence of embryonal stem cell markers such as 

30 SSEA-3, -4, Tra-1-60 and for appearance of markers of differentiated marker 
antigens such as A2B5, ME3 1 1 and N901 . 
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Design of oligonucleotide primers 

Primers for use in PCR were designed on a Macintosh Power PC, using the "Primer 
Selecf ' program of the DNASTAR software package (DNASTAR Inc.). All primers 
5 used in this study are shown in Table 2 



Table 2 List of oligonucleotide primers 



Gene 


GenBank 
accession 


Primer direction 


Prinmer 
location 


Primer sequence 5' to 3' 












Wnt-13 


Z71621 


Forward 


1159-1178 


Tgagtggttcctgtactctg 


Reverse 


1503-1484 


Actcacactgggtaacacgg 


SFRP4 


XMJ)04706 


Forward 


858-880 


Agaggagtggctgcaatgaggtc 


Reverse 


1159-1142 


Gcgcccggctgttttctt 


Waff 


U03106 


Forward 


487-506 


Cagggtcgaaaacggcggca 


Reverse 


947-928 


Aggagccacacccctccaga 


p-actin 


NMJ)01101 


Forward 


326-357 


Atctggcaccacaccttctacaatgagctgc 

fl_ 


Reverse 


1163-1132 


Cgtcatactcctgcttgctgatccacatctgc 


neuroDI 


NMJ)02500 


Forward 


240.-263 


Aagccatgaacgcagaggaggapt 


Reverse 


818-799 


Agctgtccatggtaccgtaa 



All PCR data presented in this study were duplicated in independent experiments to 
10 eliminate the possibility of methodological error. However, duplicate experiments 
were performed on identical samples and do not, therefore, control for variability 
between separate batches of cells. Polymerase chain reactions from which 
quantitative interpretations were to be made were controlled by parallel ampHfication 
of the cyclin-dependent kinase inhibitor, Wafl. This transcript has been demonstrated 
15 by other workers in the laboratory to be constitutively expressed by NTERA2 EC 
cells and differentiated derivatives (unpublished data). Furthermore, Wafl has been 
shown to exhibit an approximately 20-fold lower abundance in the NTERA2 system 
than the more widely used control, p-actin, and is therefore well suited to the analysis 
of rare transcripts. 

20 

PCR Reaction conditions 

PCR mixes were assembled on ice, with the following components per reaction: 

is 
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5u.lof25mMMgCl 2 
5ul of 1 Ox reaction buffer 
5ulof ImM dNTPs 
3ul of forward primer at 5pmol/ul 
5 3u.l of reverse primer at 5pmol/ul 

0.3ul of Taq polymerase at 1 unit/ul (Promega) 
template and H 2 0 to give 50(j1 final volume 

A premix was made containing all reaction components bar the template. Premix was 
10 then added to the reaction vessels containing the template, on ice. The reaction 
vessels were then transferred to the thermal cycler. The PCR programs used are 
shown in Table 3, with' cycling from Tl <3>T2=3>T3 -^Tl . 



15 



Table 3 PCR thermal cycling programs 



Program 1 Program 2 Program 3 Program 4 

XI 96°C/30 seconds 94°C/60 seconds 94°C/90 seconds 95C/90 seconds 
(temp/duration) 

X2 50°C/1 5 seconds 55C/90 seconds 60°C/90 seconds 63°C/60 seconds 
(temp/duration) 

X3 60C/240 seconds 72-C/60 seconds 72-C/120 seconds 72<C/60 seconds 
(temp/duration) 

Cycles 25 35 35 35 



List of DNA and protein accession numbers of genes used in results 



Gene 
Name 


Description 


cDNA Accession Number 


Protein Accession 
Number 


WNT2B 


wingless-type MMTV 
integration site family, 
member 2B 


AB045116 


Q93097 



36 
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member 2B 






SFRP1 


secreted frizzled- 
related protein 1 


AF056087 T 


AAC12877 


SFRP4 


secreted frizzled- 
related protein 4 


AF026692 


AAC04617 


FRZB 


frizzled-related protein 


NM 001463 { 


NP 001454 


SFRP2 


secreted frizzled- 
related protein 2 






FZD1 


frizzled (Drosophila) 
homolog 1 


AB017363 ~~ T 


BAA34666 


FZD2 


frizzled (Drosophila) 
homolog 2 


NM 001466 | 


NP 001457 


FZD9 


frizzled (Drosophila) 
homolog 9 


HSU82169 | 


AAC51174 


FZD3 


frizzled (Drosophila) 
homolog 3 


Kirikoshi et. al., 2000 j 


Kirikoshi et. al., 
2000 


FZD5 


frizzled (Drosophila) 
homolog 5 






FZD4 


frizzled (Drosophila) 
homolog 4 


NM_012193 


NP_036325 


FZD6 


frizzled (Drosophila) 
homolog 6 


AB012911 


BAA25686 


FZD7 


frizzled (Drosophila) 
homolog 7 


AB017365 


BAA34668 


DVT 2 


dishevelled 2 
(homologous to 
Drosophila dsh) 


NM 004422 


NP004413 


DVL3 


dishevelled 3 
(homologous to 
Drosophila dsh) 


NM_004423 


NP_004414 


GSK3B 


glycogen synthase 
kinase 3 beta 


NM_002093 


NP_002084 


AXfNl 


axin 


AF009674 


AAC51624 


APC 


adenomatosis 
polyposis coli 


NM_OO0038 


]s!P_000029 


TCF1 


transcription factor 1, 
hepatic; LF-J31, 
hepatic nuclear factor 
(HNF1), albumin 
proximal factor 


M57732 


1 a a a nonn 

AAA88077 



Examples 

Expression of a single Wnt gene, Wnt-13(2B) was detected. This transcript was 
absent in NTERA2 EC cells, but showed marked up-regulation following RA 
5 treatment, figure 24. Members of the FRP family, encoding putative Wnt antagonists, 
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also showed altered expression during differentiation, figure 24. Both Frp-1 and 
SARP-1 were down-regulated following RA treatment, whilst FrpHE was absent in 
EC cells, but expressed at high levels in RA treated cultures. 

5 Several members of the frizzled family were also detected, providing a candidate 
receptor system for Wnt-13, figure 24. Two of these, hFz-4 and hFz-6, showed 
developmental regulation. Transcripts corresponding to intracellular components of 
the Wnt pathway, including Dishevelled, GSK-3b, Axin, APC and TCF were present 
at equivalent levels in EC and differentiating cultures. CBP was also ubiquitously 

10 expressed. 
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CLAIMS 

1. A method to modulate the differentiation of an embryonic stem cell 

5 comprising: 

i) providing a culture of embryonic stem cells; 

ii) providing at least one ligand, or the active binding fragment thereof; capable 
of binding its cognate receptor polypeptide expressed by said embryonic stem 
cell; 

10 iii) forming a culture comprising embryonic stem cells and said ligand; and 
iv) growing said cell culture. 

2. A method according to Claim 1 wherein said ligand is encoded by a nucleic 
acid molecule selected from the group consisting of: 

15 i) a nucleic acid molecule as represented in Figure 22; 

ii) a nucleic acid molecule which hybridises to the nucleic acid in (i) and 
which encodes a ligand capable of binding a Wnt receptor; and 

iii) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (i) and (ii) above. 

20 

3 . A method according to Claim 2 wherein said ligand is encoded by a nucleic 
acid molecule selected from the nucleic acid sequences represented in: Fig 30; Fig 
32; Fig 34; Fig 36; Fig 38; Fig 40; Fig 42; Fig 44; Fig 47; Fig 49; Fig 51; Fig 53; Fig 
55. 

25 

4. A method according to Claim 2 or 3 wherein said ligand is encoded by a 
nucleic acid molecule as represented by the nucleic acid sequence in Fig 22. 
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5. A method according to Claim 1 wherein said ligand is encoded by a nucleic 
acid molecule selected from the group consisting of: 

i) a nucleic acid molecule as represented in Figures 2, 4, 5, 7, 10, 12, 14, 16, or 
18. 

5 ii) a nucleic acid molecule which hybridises to the nucleic acid in (i) and which 
encodes a ligand capable of modulating embryonic stem cell differentiation; 
and 

iii) nucleic acid molecules which are degenerate as a result of the genetic code to 
the sequences defined in (i) and (ii) above. 

10 

6. A method according to Claim 5 wherein said ligand is selected from the 
group comprising the amino acid sequences in Figures 3, 6, 8, 9, 11, 13, 15, 17, 19, 
or polypeptide variants thereof. 

15 7. A method according to any of Claims 1-6 wherein said cells are induced to 
differentiate by the addition of at least one agent selected from the group consisting 
of: retinoic acid; hexamethylene bisacetamide; bone morphogenetic proteins; 
bromodeoxyuridine; hthium; sonic hedgehog. 

20 8. A method for modulating the differentiation of embryonic stem cells 
comprising: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
group consisting of: 

a) a nucleic acid molecule as represented in Figures 2, 4, 5, 7, 10, 12, 14, 16, 18. 
25 b) a nucleic acid molecule which hybridises to the nucleic acid in (ii) and which 
encodes a ligand capable of modulating embryonic stem cell differentiation; 
and 

c) nucleic acid molecules which are degenerate as a result of the genetic code to 
the sequences defined in (a) and (b) above. 
30 ii) forming a culture comprising the cell identified in (i) above with an 
embryonic stem cell; and 
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iii) growing said culture under conditions suitable for the maintenance and/or 
differentiation of said embryonic stem cell. 

9. A method for modulating the differentiation of embryonic stem cells 
5 comprising: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
group consisting of: 

a) a nucleic acid molecule as represented by the sequence in Figure 22; 

b) a nucleic acid molecule which hybridises to the nucleic acid in (a) and 
10 which encodes a ligand capableof binding a Wnt receptor; and 

c) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (a) and (b) above. 

ii) forming a culture comprising a cell identified in (i) above with an embryonic 
stem cell; and 

15 iii) growing said culture under conditions suitable for the maintenance and/or 
differentiation of embryonic stem cells. 

10. A method according to Claim 9 wherein said cell expresses Wnt-13 ligand. 

20 11. A method according to any of Claims 9 or 10 wherein said cells are induced 
to differentiate by the addition of at least one agent selected from the group 
consisting of: retinoic acid; hexamethylene bisacetamide; bone morphogenetic 
proteins; bromodeoxyuridine; lithium; sonic hedgehog. 

25 12. A method according to any of Claims 1-11 wherein said nucleic acid 
molecule encodes a ligand of human origin. 

13. A method according to any of Claims 1-12 wherein said embryonic stem cells 
are of human origin. 

30 

14. A method according to any of Claims 8-13 wherein said transfected cell is a 
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mammalian cell. 

15. A cell according to Claim 14 wherein said cell is selected from the group 
consisting of: a Chinese hamster ovary cell; murine primary fibroblast cell; human 

5 primary fibroblast cell; transformed mouse fibroblast cell-line STO. 

16. A method for inhibiting the differentiation of embryonic stem cells 
comprising the steps of: 

i) providing at least one polypeptide, or active fragment thereof, wherein said 
1 0 polypeptide is an inhibitor of the Wnt signalling pathway. 

iii) forming a culture comprising the polypeptide identified in (i) above with an 

embryonic stem cell; and 
iii) growing said culture under conditions suitable for the maintenance of 
embryonic stem cells in an undifferentiated state. 

15 

17. A method according to Claim 16 wherein said inhibitor is selected from the 
group consisting of the active binding fragments thereof of the following 
polypeptides: frizzled related polypeptides (FRP); Wnt Inhibitory Factors (W1F); 
Dickkopf; Cerebrus. 

20 

18. A method according to Claim 1 7 wherein said inhibitor is encoded by a 
nucleic acid molecule selected from the nucleic acid sequences represented by: Fig 
57; Fig 59; Fig 61; Fig 63; Fig 65; Fig 67; Fig 69; Fig 71 ; Fig 73; Fig 75; Fig 77; Fig 
79; Fig 81; Fig 83; Fig 85; Fig 87; Fig 89; Fig 91; Fig 93; Fig 95; Fig 97; Fig 99; Fig 

25 101; or Fig 103. 

19. A method for inhibiting the differentiation of embryonic stem cells 
comprising the steps of: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
30 group consisting of: 

a) a nucleic acid molecule encoding a Wnt inhibitory polypeptide; 
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b) a nucleic acid molecule which hybridises to the nucleic acid in (a) and 
which encodes a polypeptide capable of inhibiting Wnt signalling; and 

c) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (a) and (b) above. 

5 ii) forming a culture of the cell identified in (i) above with an embryonic stem 
cell; and 

iii) growing said culture under conditions suitable for the maintenance of 
embryonic stem cells in an undifferentiated state. 



10 20. A method according to Claim 1 9 wherein said cells express at least one Wnt 
inhibitory polypeptide selected from the group consisting of the active binding 
fragments thereof of the following polypeptides: frizzled related polypeptides (ERP); 
Wnt Inhibitory Factors (WIF); Dickkopf; Cerebrus. 

15 21 . A method according to Claim 19 wherein said cells express at least one Wnt 
inhibitory polypeptide encoded by a nucleic acid molecule selected from the nucleic 
acid sequences represented by : Fig 57; Fig 59; Fig 61; Fig 63; Fig 65; Fig 67; Fig 69; 
Fig 71; Fig 73; Fig 75; Fig 77; Fig 79; Fig 81; Fig 83; Fig 85; Fig 87; Fig 89; Fig 91; 
Fig 93; Fig 95; Fig 97; Fig 99; Fig 101; Fig or 103. 

20 

22. A cell or cell culture obtainable by the method according to any of Claims 1- 
21. 

23. A therapeutic cell composition obtainable by the method according to any of 
25 Claims 1-15. 

24. Use of a cell according to Claim 23 for the manufacture of a composition for 
use in the treatment of a disease selected from the group consisting of: 
Parkinson's disease; Huntington's disease; motor neurone disease; heart disease; 

30 diabetes; liver disease (eg cirrhosis); renal disease; AIDS. 
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25. A method of treatment of an animal, preferably a human, comprising 
adniinistering a cell composition comprising embryonic stem cells which have been 
induced to differentiate into at least one cell-type by the method according to any of 

5 Claims 1-14. 

26. Condition medium obtained by culturing embryonic stem cells according to 
the method of any of Claims 1-21 . 
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Figure 2 

GTCCAGCGGTACCATGGGCCGTCGGAGCGCGCTAGCCCTTGCCGTGGTCTCTGCCCTGC 

?gtgccag5tctggagctccggcgtatttgagctgaa 

GAAGGGGCTGCTG^ 

tg^ggaccttctttcgcgtatg 
aSctg^a^ 

CTGATGGCGCAGGCATCGACCCCGCC 
ScTGGCC^ 

otScaac^^ 

actgtgggaga\gaatggtctcaggaccttcacagtagcggccgcacagacctccggt 

ACTmACCGG^ 

cgacctcggg^tgacgcctttggccacttcacctggggggacagaggggagaagatgt 
gcgaccctggctggaaaggccagtact^ 

tgaccaaca^ 

g^cgctactgcgatgagtg^^ 

Scctcgcagtgtaactgccagga^ 

gaactactgtactcaccataagccgtgcaggaatggagccacctgcacgaacacgggc 

CA^GGAG^ 

GGACAGCTTCTCTTGCACCTGKrCCTCCCGGCTTCTATGGCAAGGTCTGTGAGCTGAGCG 

C^TGACCTGTGCAGATGGCCCTTGCTTCAATGGAGGACGATGTTCAGAT^^ 

GGAGGCTACACCTGCCATTGCCCCTTGGGCTTCTCTGGGTT 

GGATCTCTGGGGCTCTTCCCCTTGTTCTAACGGTGCCAAGTGTGTGGACCT 

rTTACCTGTGCCGGTGCCAGGCTGGCTTCTCCGGGAGGTACTGCGAGGACAATGTGGA 

TGACTGTGCCTCCTGCCCGTGTGCAAATGGGGGCACCTGCCGGGACAGTGTGAACGAC 

Sgtgagca^ 

atgtgtgagtgcggccagggctatg<k:gg 

accaccagggcccatggtggtggacctcagtgagaggcatatggagagccagggcgg 

GCCCTCCOTGG^ 

gctgtggtgctgtggtggtctgcgtccggctgaagctacagaaacaccagcctccacct 

GAACCCTGTGGGGGAGAGACAGAAACCATGAACAACCTAGCCM^ 
AAGGACGTTTCTGTTAGCATCATTGGGGCTACCCAGATCAAGAACACCAACAAGAAGG 

CGGACITTC^ 

TGTGGACTATAACCTCGTTCGAGACCTCAAGGGAGATGAAGCCACGGTCAGGGATACA 
CACAGCAAACGTGACAGCAAGTGCCAGTCACAGAGCTCTGGAGGAGAAGAGAAGATC 

GCCCCA^ 

CTACTTCAAAGGACACCAAGTACCAGTCGGTGTATGTTCTGTCTGCAGAAAAGGATGA 

GTGTGTTATAGCGACTGAGGTGTAAGATGGAAGCGATGTGGCAAAATTCCCATTTCTCT 

CAAATAAAATTCCAAGGATATAGCCCCGATGAATGCTGCTGAGAGAGGAAGGGAGAG 

agcagagcx;gcccgacactgccaggctaggg™^ 

TTGTTCCCATTGCACTATGGACAGTTGCTrTGAAGAGTA^ 
ACITGATTCATATAGGAAGCACGCACTGCCCACACGTCTATC^^ 

AGTCTTTCCrTGAACTAGAAACACAACTGCCTTTATTGTCCTTTTTGATACTGAGATGTG 
ATATTTTTCATGATATCTGTAAAGCTTG 

AAATTTTGGTAAATATGTACAAAGK3CACTTCGGGTCTATGTGACTATATTTTTCT 
ATAAATGTATT^TGGAATATTGTGCAAAT^ 
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Figure3 



TFFR 



GCSWCRPRDDAFGHFTCGDRGEKMCDPGWKGQYCTDPia^GOTDQI^ 

RVGWQGRYCDEORYPGCIilGTCQQ 
CTNTGQGSYTCSCRPGYTGANCEIJW^ 

lsamtcadgpcf^ggrcsdwdggytotcplgfsgfncer™dlcgsspcsngakcvdl 

RCl^CHNGATCHQRGQRYMCECAQ^ 

PFPWVAVCAGVVLXHXLLLGCAAVWCVR 

DVSVSnGATQIRirakKADFHGDHGAKI^ 

RDTKCQSQSSAGEEKIAPTLRGGEfPDRKRPESWSTSKDTKYQSVW 



Figure 4 

cgggcagaggtggaagaggggggacx:gcctcaaagaag^^^ 

AGGCCGGGCTCTTTGCCTTCTGGAACGCGCGGGTCTTG 

GTTGTTTTCCAGTCGTGCATGCTCCAATCCACGGAGTATATTAGAGCCGGGACGCGGCG 
GCCGCGGGGGCAGCGACGACGGCACKJCTCGGCGGGAGCACCAGCGC^ 

gcggcgtccggagtgcccgtggcgcgcggcck:agcgatgcggtccccacggac^gcg 

GCCGGCCCGGGCGCCCCCTGAGTCTTCTGCTCGCCCTGCTCTGTGCCCTGCGAGCCAAG 

GTGTGCGGGGCCTCGGGTCAGTTTGAGCTGGAGATCCTGTCCATGCAGAACGTGAATG 

GAGAGCTACAGAATGGGAACTGTTGTGGTGGAGTCCGGAACCCTGGCGACCGCAAGTG 

CACCCGCGACGAGTGTGATACGTACTTCAAAGTGTGCCTCAAGGAGTATCAGTCCCGC 

GTCACTGCCGGGGGACCCTGCAGCTTCGGCTCAGGGTCT 

ACACCTTCAATCTCAAGGCCAGCCGTGGCAACGACCGTAATCGCATCGTACTGCCTTTC 

AGTTTCGCCTGGCCGAGGTCCTACACTTTGCTGGTGGAGGCCTGGGATTCCAGTAATO 

CACTATTCAACCTGATAGCATAATTGAAAAGGCTTCTCACTCAGGCATGATAAACCCTA 

GCCGGCAATGGCAGACACTGAAACAAAACACAGGGATTGCCCACTTCGAGTATCAGAT 

CCGAGTGACCTGTGATGACCACTACTATGGCTTTGGCTGCAATAAGTTCTGTCGTCCCA 

GAGATGACTTCTTTGGACATTATGCCTGTGACCAGAACGGCAACAAAACTTGCATGGA 

AGGCTGGATGGGTCCTGATTGCAACAAAGCTATCTGCCGACAGGGCTGCAGTCCCAAG 

CATGGGTCTTGTAAACTTCCAGGTGACTGCAGGTGCCAGTACGGTTGGCAGGGCCTGT 

ACTGCGACAAGTGCATCCCGCACCCAGGATGTGTCCACGGCACCTGCAATGAACCCTG 

GCAGTGCCTCTGTGAGACCAACTGGGGTGGACAGCTCTGTGACAAAGATCTGAATTAC 

TGTGGGACTCATCAGCCCTGTCTCAACCGGGGAACATGTAGCAACACTGGGCCTGACA 

AATACCAGTGCTCCTGCCCAGAGGGCTACTCGGGCCCCAACTGTGAAATTGCTGAGCA 

TGCTTGTCTCTCTGACCCCTGCCATAACCGAGGCAGCTGCAAGGAGACCTCCTCAGGCT 

TTGAGTGTGAGTGTTCTCCAGGCTGGACTGGCCCCACGTGTTCCACAAACATCGATGAC 

TGTTCTCCAAATAACTGTTCCCATGGGGGCACCTGCGAGGATCTGGTGAATGGATTCAA 

GTGTGTGTGCCCGCCCCAGTGGACTGGCAAGACTTGTCAGTTAGATGCAAATGAGTGC 

GAGGGCAAACCTTGTGTAAATGCCAGATCCTGTAAGAATCTGATTGCCAGCTACTACTG 

TGATCGCCTTCCTGGCTGGATGGGTCAGAACT^ 

GCCAGTGTCAGAATGACGGCTCCrGTCGGGATTTGGTTAATGGTTATCGCTGTATCTGT 
CCACCTGGCTATGCAGGCGATCACTGTGAGAGAGACATCGATGAGTGTGC^ 
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ACTGGTTTCTCTGGAAACCTCTGTCAGCTGGACATCGATTACTGCGAGCCCAACCCTTG 

CCAGAATGGCGCCCAGTGCTACAATCGTGCCAGTGACTATTTCTGCAAGTGCCCCGAG 

GACTATGAGGGCAAGAACTGCTCACACCTGAAAGACCACTGCCGTACCACCACCTGCG 

AAGTGATTGACAGCTGCACTGTGGCCATGGCCTCCAACGACACGCCTGAAGGGGTGCG 

GTATATCTCTTCTAACGTCTGTGGTCCCCATGGGAAGTGCAAGAGCCAGTCGGGAGGC 

AAATTCACCTGTGACTGTAACAAAGGCTTCACCGGCACCTACTGCCATGAAAATATCA 

ACGAGTGCGAGAGCAACCCCTGTAAAAACGGTGGCACCTGCATCGATGGCGTTAACTC 

CTACAAGTGTATCTGTAGTGACGGCTGGGAGGGAGCGCACTGTGAGAACAACATAAAT 

GACTGTAGCCAGAACCCTTGTCACTACGGGGGTACATGTCGAGACCTGGTCAATGACT 

TTTACTGTGACTGCAAAAATGGCTGGAAAGGAAAGACTTGCCATTCCCGTGACAGCCA 

GTGTGACGAAGCCACGTGTAATAATGGTGGTACCTGCTATGATGAAGTGGACACGTrr 

AAGTGCATGTGTCCCGGTGGCTGGGAAGGAACAACTTGTAATATAGCTAGAAACAGTA 

GCTGCCTGCCGAACCCCTGtCATAATGGAGGTACCTGCGTGGTCAATGGAGACTCCTTC 

ACCTGTGTCTGCAAAGAAGGCTGGGAGGGGCCTATTTGTACTCAAAATACCAACGACT 

GCAGTCCCCATCCTTGTTACAATAGCGGGACCTGTGTGGACGGAGACAACTGGTATCG 

GTGCGAATGTGCCCCGGGTTTTGCTGGGCCAGACTGCAGGATAAACATCAATGAGTGC 

CAGTCTTCCCCTTGTGCCTTTGGGGCCACCTGTGTGGATGAGATCAATGGGTACCAGTG 

TATCTGCCCTCCAGGACATAGTGGTGCCAAGTGCCATGAAGTTTCAGGGCGATCTTGCA 

TCACCATGGGGAGAGTGATACTTGATGGGGCCAAGTGGGATGATGACTGTAACACCTG 

CCAGTGCCTGAATGGACGGGTGGCCTGCTCCAAGGTCTGGTGTGGCCCGAGACCTTGC 

AGGCTGCACAAAAGCCACAATGAGTGCCCCAGTGGGCAGAGCTGCATCCCGGTCCTGG 

ATGACCAGTGTTTCGTGCGCCCCTGCACTGGTGTTGGCGAATGTCGGTCCTCCAGCCTC 

CAGCCAGTGAAGACCAAGTGCACATCTGACTCCTATTACCAGGATAACTGTGCAAACA 

TCACTTTCACCITrAACAAAGAGATGATGTCTCCAGGTCTTACCACCGAACACATTTGC 

AGGGAATTGAGGAATTTGAATATCCTGAAGAATGTTTCTGCTGAATATTCGATCTACAT 

AGCCTGTGAGCCTTCCCTGTCAGCAAACAATGAAATACACGTGGCCATGTCTGCAGAA 

GACATCCGGGATGATGGGAACCCTGTCAAGGAAATTACCGATAAAATAATAGATCTCG 

TTAGTAAACGGGATGGAAACAGCTCACTTATTGCTGCGGTTGCAGAAGTCAGAGTTCA 

GAGGCGTCCTCTGAAAAACAGAACAGATTTCCTGGTTCCTCTGCTGAGCTCTGTCTTAA 

CAGTGGCTTGGGTCTGTTGCTTGGTGACAGCCTTCTACTGGTGTGTACGGAAGCGGCGG 

AAGCCCAGCAGCCACACTCACTCCGCCCCCGAGGACAACACCACCAACAATGTGCGGG 

AGCAGCTGAACCAAATCAAAAACCCCATCGAGAAACACGGAGCCAACACGGTCCCCA 

TTAAGGATTACGAGAACAAAAACTCCAAAATGTCAAAAATCAGGACACACAACTCGG 

AAGTGGAGGAGGATGACATGGATAAACACCAGCAGAAAGTCCGCTTTGCCAAACAGC 

CAGTGTATACGCTGGTAGACAGAGAGGAGAAGGCCCCCAGCGGCACGCCGACAAAAC 

ACCCGAACTGGACAAATAAACAGGACAACAGAGACTTGGAAAGTGCCCAGAGCTTGA 

ACCGGATGGAATACATCGTATAGCAGACAGTGGGCTGCCGCCATAGGTAGAGTTTGAG 

GGCACCGCGGGCCG 



Figure 5 

CTGCGGCCGGCCCGCGAGCTAGGCTGGGTTTTTTTITrTCTCCCCTCCCT 

TCCATGCAGCTGATCTAAAAGGGAATAAAAGGCTGCGCATAATCATAATAATAAAAGA 

AGGGGAGCGCGAGAGAAGGAGAAAGCCGGGAGGTGGAAGAGGAGGGGGAGCGTCTC 

AAAGAAGCGATCAGAATAATAAAAGGAGGCGGCTCTTTGCCTTCTGGAACGGGCCGCT 

CTTGAAAGGGCTTTTGAAAAGTGGTGTTGTTTTCCAGTCGTGCATGCTCCAATCGGCGG 

AGTATATTAGAGCCGGGACGCGGCGGCCGCAGGGGCAGCGGCGACGGCAGCACGGCG 

GCAGCACCAGCGCGAACAGCAGCGGCGGCGTCCCGAGTGCCCGCGGCGCGCGGCGCA 

GCGATGCGTCCCCACGGACGCGCGGCCGGTCCGGGCGCCCCCTAAGCCTCCTGCTCGC 

CCTGCTCTGTGCCCTGCGAGCCAAGGTGTGTGGGGCCTCGGGTCAGTTCGAGTTGGAG 

. a top a n. k a wtp. a a nnnnsx t, rtrrar A o A A C.GOG A ACTGCTGCGGCGGC 
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GCCCGGAACCCGGGAGACCGCAAGTGCACCCGCGACGAGTCT 

gtgcctcaaggagtatcagtccccxct^^ 

AGKKtTCCACGCCTGTCATCGGGGGCAACACCTTCAA^ 

gaccgcaacccatcgtgctgcctttcagtitcgcctggccgaggtcc^ 

GTCGAGGCGTGGGAITCCAGTAATGA^^ 
CTTCTCACTCGGGCATGATCAACC^^^ 

(kkk:gttgcccactitgagtatcagatccgcgtgacctgtcat^^ 

TTGGCTGCAATAAGTTCTGCCGCCCCAGAGATGACTTCTTT^ 
ACCAGAATGGCAACAAAACTTGCATGGAAGGCTGGATGGGCC 
CTATTTGCCGAAAGGCTGCAGTCCTAAGCATGGGTCTTGCAAACTCCCA 
GGTGCCAGTATGGCTGGCAAGGCCTGTACTGTGATAAGTGCATCCCACACCCGGGATG 

cgtccacgck:atctgtaatgagccctggcagtgcctctgtgagaccaactggggcggc 

CAGCTCTGTGACAAAGATCTCAATTACTGTC^ 
GAACTTGTAGCAACACAGGCCCHXjACAAA 

AGGACCCAACTGTGAAATTGCTGAGCACGCCTGCCTCTCTGATCCCTGTCACAACAGA 

GGCAGCTGTAAGGAGACCTCCCTGGGGTTTGAGTGTGAGTGTTCCCCAGGCTGGACCG 

GCCCCACATGCTCTACAAACATTGATGACTGTTCTCCTAATAACTGTTCCCACGGGGG^ 

ACCTGCCAGGACCTGGTTAACGGATTTAAGTGTGTGTGCCCCGCACAGTGG^ 

AAACGTGCCAGTTAGATGCAAATGAATGTGAGGCCAAACCTTGTGTAAACGCCAAATC 

CTGTAAGAATCTCATTGCCAGCTACTACTGCGACTGTCTTCCCGGCTGGATGGGTCAGA 

A^GTCACATAAATATTAATGACTGCCTTGGCCAGTGTCAGAATGACGCCTCCTGTCGG 

GATTTGGTTAATGGTTATCGCTGTATCTGTCCACCTGGCTATGCAGGCGATCACTGTGA 

GAGAGACATCGATGAATGTGCCAGCAACCCCTGTTTGAATGGGGGTCACTGTCAGAAT 

GAAATCAACAGATTCCAGTGTCTGTGTCCCACTGGTTTCTCTGGAAACCTCTGTCAGCT 

GGACATCGATTATTGTGAGCCTAATCCCTGCCAGAACGGTGCCCAGTGCTACAACCGT 

GCCAGTGACTATTTCTGCAAGTGCCCCGAGGACTATGAGGGCAAGAACTGCTCACACC 

TGAAAGACCACTGCCGCACGACCCCCTGTGAAGTGATTGACAGCTGCACAGTGGO^T 

GGCTTCCAACGACACACCTGAAGGGGTGCGGTATATTTCCTCCAACGTCTGTGGTC 

ACGGGAAGTGCAAGAGTCAGTCGGGAGGCAAATTCACCTGTGACT^^ 

CACGGGAACATACTGCCATGAAAATATTAATGACTGTGAGAGCAACCCTTGTAGAAAC 

GGTGGCACTTGCATCGATGGTGTCAACTCCTACAAGTGCATCTGTAGTGACGGGTGGG 

AGGGGGCCTACTGTGAAACCAATATTAATGACTGCAGCCAGAACCCCTGCCACAATGG 

GGCACGTGTCGCGACCTGGTCAATGACTTCTACTGTGACTGTAAAAATGGGTGGAAAG 

GAAAGACCTGCCACTCACGTGACAGTCAGTGTGATGAGGCCACGTGCAACAACGGTGG 

CACCTGCTATGATGAGGGGGATCTTTTAAGTGCATGTGTCCTGGCGGCTGGGAAGGAA 

CAACCTGTAACATAGCCCGAAACAGTAGCTGCCTGCCCAACCCCTGCCATAATGGGGG 

CACATGTGTGGTCAACGGCGAGTCCTTTACGTGCGTCTGCAAGGAAGGCTGGGAGGGG 

CCCATCTGTGCTCAGAATACCAATGACTGCAGCCCTCATCCCTGTTACAACAGCGGCAC 

CTGTGTGGATGGAGACAACTGGTACCGGTGCGAATGTGCCCCGGGTTTTGCTGGGCCC 

GACTGCAGAATAAACATCAATGAATGCCAGTCTTCACCITGTGCCTTTGGAGCGACCTC 

TGTGGATGAGATCAATGGCTACCGGTGTGTCTGCCCTCCAGGGCACAGTGGTGCCAAG 

TGCCAGGAAGTTTCAGGGAGACCTTGCATCACCATGGGGAGTGTGATACCAGATGGGG 

CCAAATGGGATGATGACTGTAATACCTGCCAGTGCCTGAATGGACGATCGCCTGCTCA 

AAGGTCTGGTGTGGCCCTCGACCTTGCCTGCTCCACAAAGGGCACAGCGAGTGCCCCA 

GCGGGCAGAGCTGCATCCCCATCCTGGACGACCAGTGCTTCGTCCACCCCT^ 

GTGGGCGAGTGTCGGTCTTCCAGTCTCCAGCCGGTGAAGACAAAGTGCACCTCTGACT 

CCTATTACCAGGATAACTGTGCGAACATCACATTTACCTTTAA 
CCAGGTCTTACTACGGAGCACATTTGCAGTGAATTGAGG 

TGTTTCCGCTGAATATTCAATCTACATCGCTTGCGAGCCTTCCCCnTCAGCGAAC 

AAATACATGTGGCCATTTCTGCTGAAGATATACGGGATGATGGGAACCCGATCAAG 

GA^ATCACTGACAAAATAATOGATCTTCCTAGTAAAC^ 
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CCTTGTTCCCTrGCTGAG^ 



rrAf^AAGC^^OT^^CCAAGCA^CGGCGTATACGCTGOTAGACA^GAAGAGAA 
^C^CC^^^^C^CG^G^AACACCCAAACTGGACAAACAAACAGGACAACAGJ^ 



ACTGCCGCC^AGGTAG^GG^^™^™ 
tXTTACACTGGCAATGGTAGTTTCTGTG^ 



CGAGTCTGAGGCCGTTGCT 



ir^TATGCAAAAAGCTAGTCAACAGTACCCTGGTTGTGTGTCCCCTTCKAGCCGACAC 
^™^P^V^VQ^TCOTAGGAGCCrGCCCAGCCCCCTGGTCTTTGAGCTCCCACTTC 



A^GTATTTr^^^^^^GGGAGGGGAGACTTTGATGTCAGCAGrrGCTGGTAAAATG 



rrnTTATTAATCACTGTGTATATTrGATTTATTAACTTAATAA^ 



ATCAlTCCTrmATrTATATGTATGTGT™^ 

1 

ATGGAATTITnTrOTACGACGfcTGATGm 



TOj^^WmGAACTCTTCTCATTACTTGTTCCCTATAAGCCA^ 



TAT1 



TATGAACATACAAATGCATCTTTGATGTGTTGnCTTGGCAAlA^ 
rAATAGGCTCT^CTGA^ 



CCCCATCTAGGGGAGCCTTGATTCCCTTGTTATTCAACAGCAAGTGTGAATACTGCT^ 
AATAAACACCACTGGATTAATGGAAAAAAAAAAAAAAAA 



Figure 6 
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NRASDYKXCPEDYEG^ 
GKCKSQSCKjKFTCDCMvGFTC 

YCETNINDCSQNPCHNGGTCRDLVlSODFyCDCKNGWKGKT 

DEGDAFKCMCTGGWEGTTCNIAI^SSCIJP^ 

NTNDCSPHPCYNSGTCVDGDNWYRCECAPGFAGPD^^ 

GYRCVCTPGHSGAKCQEVSGP^CITMGSVlPDGAKWDDDOTCQCI^GmCSKVWCGPR 

PCII.HKGHSECPSGQSCPIII)DQCFVHPCTGVGEC^SSLQPVKTKCTSDSYYQDNCANIT 

FTFNXEMMSPGLTTEfflCSEn^LNILKNVSAEYSIYIACEPSPSAl^ 

PIKEITDiniDLVSiGRDGNSSLIAAV^ 

YWCLRKRRXPGSHTHSASEDNTTNNVREQLNQI^^ 

THNSEVEEDDMDIGHQQKARFAKQPAYTLVDPJBEKPPNGTPTK^ 

SLNRMEYTV 



rigure / 

GGAGCGGGCGCGCGGCGGCGGCGGGGCCGCGGCGGGCGGGTCGCGGGGGCAATGCGG 

GCGCAGGGCCGGGGCCTTCCCCCCGGCGCTGCTGCTGCTGCTGGCGCTCTGGGTGCAG 

GCGGCGCGGCCCATGGGCrATTTCGAGCTGCAGCTGAGCGCGCTGCGGAACGTGAACG 

GGGAGCTGCTGAGCGGCGCCTGCTGTGACGGCGCGGCCGGACAACGCGCGCGGGGGG 

CTGCGGCCACGACGAGTGCGACACGTACGTGCGCGTGTGCCTTAAGAGTACCAGGCCA 

AGGTGACGCCCACGGGGCCCTGCAGCTACGGCCACGGCGCCACGCCCGTGCTGGGCG 

CAACTCCTTCTACCTGCCGCCGGCGGGCGCTGCGGGGGACCGAGCGCGCGCGCGGCCC 

CGGGCCGGCGCGACCAGGACCCGGGCTTCGTCGTCATCCCCTTCCAGTTCGCCTGGCCG 

CGCTCCTTTACCCTCATCGTGGAGGCCTGGGACTGGGACAACGATACCACCCCGAATG 

AGGAGCTGCTGATCGAGCGAGTGTCGCATGCCGCATGATCAACCCGGAGGACCGCTGG 

aagagcctgcacttcAgcggccacgtggcgcacctggagctgcgatccgcgtgcgctg 

CGACGAGAACTACTACAGCGCCACTTGCAACAAGTTCTGCCGGCCCCGCAACGACT 

ttttcggccactacacctgcgaccagtacggcaacaaggcctgcatggacggctggat 

GGGCAAGGAGTGCAAGGAAGCTGTGTGTAAACAAGGGTGTAATTTGCTCCACGGGGG 

ATGCACCGTGCCTGGGGAGTGCAGTGCAGCTACGGCTGGCAAGGGAGGTTCTGCGATG 

AGTGTGTCCCCTACCCCGGCTGCGTGCATGGCAGTTGTGTGGAGCCCTGGCAGTGCAA 

CTGTGAGACCAACTGGGGCGGCCTGCTCTGTGACAAAGACCTGAACTACTGTGGCAGC 

CACCACCCCTGCACCAACGGAGGCACGTGCATCAACGCCGAGCCTGACCAGTACCGCT 

GCACCTGCCCTGACGGCTACTCGGGCAGGAACTGTGAGAAGGCTGAGCACGCCTGCAC 

CTCCAACCCGTGTGCCAACGGGGGCTGTTGCCATGAGGTGCCGTCCGGCTTCGAATGCC 

ACTGCCCATCGGGCTGGAGCGGGCCCACCTGTGCCCTTGACATCGATGAGTGTGCTTCG 

AACCCGTGTGCGGCCGGTGGCACCTGTGTGGACCAGGTGGACGGCTTTGAGTGCATCT 

GCCCCGAGCAGTGGGTGGGGGCCACCTGCCAGCTGGACGTCAACGACTGTGAAGGGA 

AGCCATGCCTTAACGCTTTTTCTTGCAAAAACCTGATTGGCGGCTATTACTGTGATTGC 

ATCCCGGGCTGGAAGGGCATCAACTGCCATATCAACGTCAACGACTGTCGCGGGCAGT 

GTCAGCATGGGGCACCTGCAAGGACCTGGTGAACGGGTACCAGTGTGTGTGCCCACGG 

GGCTTCGGAGGCCGGCATTGCGAGCTGGAACGAGACAAGTGTGCCAGCAGCCCCTGCC 

ACAGCGGCGGCCTCTGCGAGGACCTGGCCGACGGCTCCACTGCCACTGCCCCCAGGGC 

TTCTCCGGGCCTCTCTGTGAGGTGGATGTCGACCTTTGTGAGCCAAGCCCCTGCCGGAA 

CGGCGCTCGCTGCTATAACCTGGAGGGTGACTATTACTGCGCCTGCCCTGATGACTTTG 

GTGGCAAGAACTGCTCCGTGCCCCGCGAGCCGTGCCCTGGCGGGGCCTGCAGAGTGAT 

CGATGGCTGCGGGTCAGACGCGGGGCCTGGGATGCCTGGCACAGCAGCCTCCGGCGTG 

TGTGGCCCCCATGGACGCTGCGTCAGCCAGCCAGGGGGCAACTTTTCCTGCATCTGTGA 

CAGTGGCTITACTGGCACCTACTGCCATGAGAACATTGACGACTGCCTGGGCCAGCCCT 

GCCGCAATGGGGGCACATGCATCGATGAGGTGGACGCCTTCCGCTGCTTCTGCCCCAG 

CGGCTGGGAGGGCGAGCTCTGCGACACCAATCCCAACGACTGCCTTCCCGATCCCTGC 
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CACAGCCGCGGCCGCTGCTACGACCTGGTCAATC 

CAACGGTGGCACCTGCHACGACAGCGGCGACAC^ 
TGGAAGGGCAGCACCTGCGCCGTCGCCAAGAACAGCAGCTGCCTGCCC^ 

TGMTGGTGGCACCTGCGTCK3<X:AC3CG(3GGCCTCC^ 

G^ScCTGAC^GCcS^ 
GGGC^ACGTGTCTGGATCAGA^ 

rr^rcrc^TGCCAGGAAGTGATCGGGTTCGGGAGATCCTGCTG 

ag^cStS 

("vga gcaccccctgcctgcc acgctcggccacctggac aat aagtgtgcccgcc 1 c al.l- 

nATrrGCTCCCTGCCAGCCACAAGGGCTGTGGCACGGGACCGCCTGCrGGTGTTCjClll 
^GACCGGGCGrcCTCG^ 

ATCACCCAG^ 

GG^GTTACGG^GGCTCTTCCACA^ 

Stg^aagaIotcacg^ 

GACTCCCTGGAGGCGGAGAAGTTCTCTCACACAAATTCACCAAAGATCCTGGCCGCTC 

gcSggaggccggccca 
Sggatc^^^ 

rTAGTTTCTTTAITTTGTGTAAAAAAACCACCAAAAACAAAAACCAAATGTTTATm 

ScGmcmAACCTTGTATAAATTATTC^^^ 

ATTCTCGG^ 

CCCTCTOTGOTCOTCT^ 

GTOAC^AAGTGCC^CACACAGACCCCTCGCAACTGTCCACGCGTGCCGTGGCACCAGG 
CGC^GCCC^ 

AAAAAAAAAAATTCCTGCCC 



Figure 8 

mAOGRGAITPAIXLLIALWVQ^ 
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PRAGGDQDPGFVVIPFQFAWPRSFrL^ 
SHIFSGHVAIDJnXJIRVRCDENY^ 

KEOCFAVCKQGCl^IIIGGCTWGECRCSYGWQGRFCDEC^^ 

NWGGLLCDKDLNYCGSHHPCmGGTCINAEPDQYRCTCPDGYSGRNCEKAEHACTSNPC 
ANGGSCHEWSGFECHCTSGWSGPTCAIX>IDECASNPCAAGGTCVDQVDGF^CICPEQWV 

GATCQLDVNDCEGKPCLNAFSCKNU^ 
DLVNGYQCVCPRGFGGRHCELERDKCASSP 

VDIX^SPCPmGARC^EGDYYCACPDDFGGKNCSWREPCPGGACRVIDGGGSDAGPG 

MPGTAASGVCGPHGRCVSQPGGNFSCK?DSGFTGTYCH^^ 

FRCFCTSGWEGEIX33T]^NDCLPDP 

DAYTCSNGGTCmSGDTFRCACPPGWKGSTCAVAKNSSCIJl^CVNGGTCVGSGASFSCI 
CPJ)GWEGRTCTH>nT^©C3SIPLPCYNGGICVDGVNWFRCECAP 

AYGATCWEINGYRCSCPPGRAGPRCQEVIGFGRSCWSRGTPFPHGSSWVEDCNSCRC1LDG 
RRDCSKWCGWKPCLLAGQPEALSAQCPLGQRCLEKAPGQCLRPPCEAWGECGAEEPPST 
PCLPRS GHLDNNC APXTLHFNRDHVPQGTTVGAICS GIRSLP ATRAV ARDR1LVLLCDRAS 
SGASAVEVAVSFSPARDIJPDSSLIQGAAHAWAAITQRGNSSIXLAVTEVKVETW 

GLLWVLCGAFSVLWLACVVIGVWWTRKR^ 

PGGHKDVLYQCKMTEPPPRRADEAIPGPAGHAAVREDEEDEDLGRGEEDSI^AEOI^H^ 
FTKDPGRSPGRPAHWASGPKVDNRAVRSINEARYAGKE 



Figure 9 

MRSPRTRGRPGRPI^LLLALLCALRAKVCGASGQFELEIl^MQNVNGELQNGNCCGG^ 

PGDRKCTRDECDTYFKVCLKEYQSRVTAGGPCSFGSGSIPVIGGNTFNLKASRGI^RNP^ 

LPFSFAWPRSYTlXVEAWDSSNDTIQPDSIffiKASHSGMWSRQWQTLKQNTGIAHITBYQIR 

WCDDHY^GFGOSIKFCRPPJ^DFFGHYACDQNGNKTCMEGWMGPDCNK^ 

GSCKLPGDCRCQYGWQGLYC3)KCIPHPGCVHGTCHB 

GTHQPCLl^GTCSNTGPDKYQCSCTEGYSGPNCEIAEHACXSDPCIWRGSCKETSSGFECE 

CSPGWTGPTCSTNIDDCSP1WCSHGGTCQDLWGFKCVCTPQWTGKTCQLDANECEAKPC 

VNARSCKM.IASYYCX>CLPGWMGQNCDIN1ND 

DHCEPJDlDECASNPCLNGGHCQNEINRFQCIXPTGFSGNLCQLDroYCEPNPCQNGA(^YN 

RASDYFCKCPEDYEGKNCSin.KDHCRTTTCEVIDSCTVAMASNDTPEGVRYISSNVCGPHG 

KCKSQSGGKFTCDCNKGFTGTYOIENESlDCESNPCXNGGTCroGW 

CENNINDCSQNPCHYGGTCRDLVNDFYCT^ 

DEVDTFKCMCPGGWEGTTCMARNSSCLPM'C^ 

NTbTOCSPHPCYNSGTCVDGDNWYRCECAPGFAG^ 

QCICPPGHSGAKCHEVSGRSCnMGRVIIJDGAKWDDDCNTCQCLNGRVACSKVWCGPRPC 

RLHKSHNECPSGQSCIPVLDDQCFVP^CTGVGECRSSSLQPVKTKCTSDSYYQDNCAMTFT 

FNKEMMSPGLTTEmCSELRNLl^KNVSAEYSIYIACEPSI^AN^ 

VKEITDKnDLVSKRDGNSSLIAAVAEVRVQRRPLKNRTDFLWLI^SVLTV 

YWCVPJCRRKPSSHTHSAPEDNTTNNV^ 

THNSEVEEDDMDKHQQKVRFAKQPVY^ 

SLNRMEYIV 



Figure 10 

TCGAGGCGGCGATGCGGGCACGCGGCTGGGGACGCCTGCCTCGGCGGCTGCTGCTGCT 
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GCGCTGCGGAACGTGAACGW^^^ 

jGGGGGCTGCGC 
:CAGGCCAAGGT< 



aaaSicCKCCGTGiSG^AACTCCnCTACCTGCCGCCGQ^^OGG 



TGCCTTAAGGAGTACCA 



ATTCCCTTTCAG^CTG^CgSgTT^ 
GGACAATGACAC 



OTCTCKCOKCCCGCAACGACTTC^^ 

gScctga^caatacc^^ 

™a£atoa?gIg^^ 
^?£ac^gotcgact^^ 

(^CGCCAATOA^TGTGAA^CGAAGCCGTGCCTTAATGCTITITCTrG 
TG^GGC^A^STGrG^rcCCTCCCGGGCTGGAAGGGCATCAACTGCCAAATCAAC 

TTCCGCTGCCACTCCCcTcG^^ 



gctaccagtgtgtgtgcccgcggggctttggaggtc^ 

CAAGTGTGCCAGCAGGCCCTGCCGCCGGGGTGGCATCTGCGAGGACCIX^ 



AC^CC^CTGCTCCCGGG^ 
^CAGCTOCCGCTGCCTGGATGGCCACC^ 

n^A^A^^AA^ 

2^^2y^Q^2^^^CCGACTCACACTGCGCTTCAACCGTGATCAAGTGC(^ 
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CAGAGCACAGXXCACGGCATCGTCKXTGCTATCACTC 

TGCTGGCTGTCACCGAGGTCAAGGTGGAAACAGTTGTTATGGGTGGCTCTrcCA^GGT 

CTGTTGGTGCCCGTGCTGTGCAGCGTGTTCAGTGTGCTGTGGCTCGCCTGTGTGG^AT 

CTGCGTATGGTGGACACGAAAGCGCAGGAAAGAACGTGAGAGGAGCCGGC^CCACG 

GGATGAGAGCACCAACAACCAGTGGGCCCCGCTCAATCCCATCCGC^^ 
CGGCCAGK3CGGCAGCGGTCTGGGAACTGGGGGCCACAAGGACATACTCTACCAGTC 
AAAAACTTCACACCGCCGCCCCGCAGGGCAGGCGAGGCACTGCCCGGGCCAOTGGCC 
ATGGGGCTGGTGGGGAGGACGAGGAGGATGAAGAGCTGAGCCGTGGAGATGGGGACT 

CCCCAGAGGCAGAGAAGTTCATCTCACACAAGTTCACCAAAGACCC^^ 
CGGAAGGCCAGCCTGCTGGGCTCCAGGGCCCAAAGTGGACAACCGCGCCGTCAGAAG 

TACCAAGGACGTGCGCCGTGCTGGCAGGGAGTAGCCAGCCACCAGGCTGGGACCCAG 
AACCOTCTCTGGCACCACGCTGCCrGCCGGACCAT^^ 

GTTTCTTTATTITGTGTAAAAAACAAAACCAAAACCAAAAAACAAATGTTTATriliTA 
CGTITCTITAACCTTGTATAAATTATTCAACGGCTGTCAGGCGGAAAACAACGGAGTAT 
TCTCGGATCATTGCTATTTTTGTAAAGTTTCCGCGTCCGCACGCACTGTGGCAGGAGAG 

CAGGGCGTGTGTATGTGTGTGTGTGTGTGTCCTCACC 



Figure 11 

MLCDKDLNYCGSHHPCVNGGTCINAEPDQYLCACPDGYLGKNCERAEHACASNPCANGG 

SCHEVPSGFECHCPSGWSGPTCALDIDECASNPCAAGGTCVDQVDGFBCICPEQWVGATC 

OLDAlsffiCEGKPCI^AFSCKNLIGGYYCDCIJGWKGINCQITINDCHGQVSAWGHLQGPVN 

GYOCVCPRGFGVPJICELEYDKCASSPCPJlGGICEDLVDGFRCHCPRGI^GmCEVDMDLC 

EPSPCFNGVRCYNLEGDYYCACPEDFGGKNCSVPRDTCPGGACRVIDGCGFEAGSRARGV 

APSGICGPHGHC^SLPGGOTSCICDSGFTGTYCHENIDDCMGQPCRNGGTGroEVDSFRCFC 

PSGWGELCDINPM)CU>DPCHSRGRCYDLVM)FYCACDDGWKGKTCHSREFQCDAYTC 

SNGGTCYDSGDimCACPPGWKGSTCTlAKNSSCWNPC^GGTCYGSGDSFSCICRDGWE 

GRTCTHNTNDCNPLPCYNGGICVDGVHWFACECAPGF 



Figure 12 

GAAGGCCATGGTCTCCCCACGGATGTCCGGGCTCCTCTCCCAGACTGTGATCCTAGCGC 

TCATTTTCCTCCCCCAGACACGGCCCGCTGGCGTCTTCGAGCTGCAGATCCACTCTTTC 

GGGCCGGGTCCAGGCCCTGGGGCGCCGCGGTCCCCCTGCAGCGCCCGGCTCCCCTGCC 

GCCTCTTCTTCAGAGTCTGCCTGAAGCCTGGGCTCTCAGAGGAGGCCGCCGAGTCCCCG 

TGCGCCCTGGGCGCGGCGCTGAGTGCGCGCGGACCGGTCTACACCGAGCAGCCCGGAG 

CGCCCGGGCCTGATCTCCCACTGCCCGACGGGCTCTTGCAGGTGCCCTTCCGGGACG 

CCTGGCCTGGCACCTTCTCTTTCATCATCGAAACCTGGAGAGAGGAGTTAGGAGACCA 

GATTGGAGGGCCCGCCTGGAGCCTGCTGGCGCGCGTGGCTGGCAGGCGGCGCTTGGCA 

GCCGGAGGCCCGTGGGCCCGGGCATTCAGCGCGCAGGCGCCTGGGAGCTGCGCTTCTC 

GTACCGCGCGCGCTGCGAGCCGCCTGCCGTCGGGACCGCGTGCACGCGCCTCTGCCGT 

CCGCGCAGCGCCCCCTCGCGGTGCGGTCCGGGACTGCGCCCCTGCGCACCGCTCGAGG 

ACGAATGTGAGGCGCCGCTGGTGTGCCGAGCAGGCTGCAGCCCTGAGCATGGCTTCTG 

TGAACAGCCCGGTGAATGCCGATGCCTAGAGGGCTGGACTGGACCCCTCTG(^CGGTC 

CCTGTCTCCACCAGCAGCTGCCTCAGCCCCAGGGGCCCGTCCTCTGCTACCACCGGATG 

CCTTGTCCCTGGGCCTGGGCCCTGTGACGGGAACCCGTGTGCCAATGGAGGCAGCTGT 

AGTGAGACACCCAGGTCCTTTGAATGCACCTGCCCGCGTGGGTTCTACGGGCTGCGGT 
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orTTGTGTGTCGGGGGTGCAGACCCTGACTCTGCCTACATCTGCCACTGCCCACCTGGT 

ATr^GAGTCTGCCTGGACCGGGCCACGCCCTGCGCTGCCGCTGCCGCGCCGGCTTC 

rrGGGTCCTCGCTGCGA^CACGACCT 
GCX3G(^CGTG]^^ 

orrrrrr A^^CGCGAGCGGGCGGACCCGTGCGCCGCGCGCCCCTGTGCTCACGGC 

CGTGGCC(3CGGGCGTGGCC(3(K:GCTGCGCTCTTGCTGGTCCACGT^^ 
rArTrrrAGGATGCTGGGTCTCGCTTGCTGGCTGGGACCCCGGAGCCGTCAGTCCACG 

cIctcccggatgcactcaa^ 
gIaatoaaStagagtctctggaagg 



Figure 13 

TVTVSPRMSGLLSQTVILAL1FLPQTRPAG 

T^PrT^A^SPCAmA^ 
Jlctwwss^prg^ 

ep^p^aScahggrcyahfsglvcacapgym^ t 

PYPSSILSVK 

Figure 14 

A A A rrGGAACGGGGCCCAACTTCTGGGGCCTGGAGAAGGGAAACGAAGTCCCCCCCG 

g^c^gaS 

TATAAAGAACGGCGCCTTTGGGAAGAGGCGGAGACCGGCTTTAAAGAAAGAAGTCTTG 

?ggScta?gatcccccgc^gtccgccgctgto^ 

GCTCTC(3GCCTTGCTGTGTCAGGTCTGGAGCTCTGGGGTGTTCGAACTGAAGCTGCAGG 

G^TGTCCCCCGAG^CGCC^ 
CGAC??OTCAGT^ 

CGOTCCCCTTCGGCTTCACCTCJGCCGGGCACCTTCTCT^ 

ACACAGATTCTCCTGATGACCTC 
G^CACCCA^ 

AGGGCTGCTCCGTTTTCTGCCGTCCC^^ 
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ATrTGCCTGCCTGGATGTGATGAGCAGCATGGATTTTGTGACAAACCAGGGGAATGCA 
TrrrrATGGCACCTGCCAGCAGCCCTGGCAGTGCAACTGCCAGGAAGGCTGGGGGOCjC 

GOT^CACCTGCGAGCTGGGGATTGACGAGTGTGACCCCAGCCCTTGTAAGAACGGAG 
CAAAATCTGTGAATrGAGTGCCATGACCTGTGCGGACGGCCCTTGCTTTAACGGGGGTC 
CTTCAACTGTGAGAAGAAAATTGACTACTGCAGCTCTTCACCCTGTTCTAATGGTGCCA 

GCACTGTGACGACAACGTGGACGACTGCGCCTCCTCCCCGT 
TGCCGGGATGGCGTGAACGACTTCTCCTGCACCTGCC^ 

ACTGCAGTGCCCCCGTCAGCAGGTGCGAGCACGCACCCTGCCACAATG^GGGCACCTG 

CCACCAGAGGGGCACGGCTATGTGTGCGAATGTGCCCGAAGCTACGGGGGTCCCAACT 

GCCAGTTCCTGCTCCCCGAGCTGCCCCCGGGCCCAGCGGTGGTGGACCTCACTGAGAA 

GCTAGAGGGCCAGGGCGGGCCATTCCCCTGGGTGGCGTGTGCGCCGGGGTCATG 

TCCTCATGCTGCTGCTGGGCTGTGCCGCTGTGGTGGTCTGCGTCCGGCTGAGGCTGCAG 

AAGCACCGGCCCCCAGCCGACCCCTGCCGGGGGGAGACGGAGAC^ 

GCAACTGCCAGCGTGAGAAGGACATCTCAGTCAGCATCATCGGGGC^ 

GAACACCAACAAAAGGCGGACTTCCACGGGGACCACAGCGCCGACAAGAATGGC^C 

AAGGCCCGCTACCCAGCGGTGGACATAACCTCGTGCAGGACCTCAAGGGTGACGACAC 

CGCCGTCAGGGACGCGCACAGCAAGCGTGACACCAGTGCCAGCCCCAGGGCTCCTCAG 

GGGAGGAGAAGGGGACCCCGACCACACTCAGGGGTGGAGAAGCATCGAAAGAAAAA 

GGCCGGACTCGGGCTGTTCAACTTCAAAAGACACCAAGTACCAGTCGGTGTACGTCAT 

ATCCGAGGAGAAGGATGAGTGCGTCATAGCAA.CTGAGGTGTAAAATGGAAGTGAGAT 

GGCAAGACTCCCGTTCTCTTAAAATAAGTAAAATTCCAAGGATATATGCCCCAAC 

TGCTGCTGAAGAGGAGGGAGGCCTCGTGGACTGCTGCTGAGAAACCGAGTTCAGACCG 

AGCAGGTTCTCCTCCTGAGGTCCTCGACGCCTGCCGACAGCCTGTCGCGGCCCGGCCGC 

CTGCGGCACTGCCTTCCGTGACGTCGCCGTTGCACTATGGACAGTTGCTCTTAAGAGAA 

ATTTTGGATTCTTATGAGCCAGTCTTTTCITGAATTAGAAACACAAACACTGC 

GTCCTTTTTGATACGAAGATGTGCT^ 

AT^GTAAAAATATTTTTCATGATATCTC 

TTATAATTTAAATTTTGGTAAATATGT^ 

TTTITGTATATAAATGm^ 
TTTGTTAATGAAGAAATTCCTTTTTAA 

C 

Figure 15 

MGSRCALALAVLSALLCQVWSSGWEIXLQEFVNKKGLLGM^ 

RVCIXHYOASVSPEPPCTYGSAVTPVLGVDSFS1PDGGGADSAKNPIRFPFGFTWPGTFSLI 

IFAnni)SPDDLATENPEmSRLATQRHLTVGEEWSQDLHSSGRTDlXySYRFVCDEH^ 

GEGCSWCRPRDDAFGHFTCGERGEKVCNPGWKGPYCTEPICLPGCDEQHGFCDKPGECK 

CRVGWOGRYCDECmYPGCmGTCQQPWQCNCQEGWGGIJC^QDUTYCTHHKPCKNGA 

TCTOTGOGSYTCSCRPGYTGATCELGIDECDPSPCKNGGSCTDIENSYSCTOT 

ISAMTCADGPCFNGGRCSDSPD^^ 
m^COAGFSGRHCDDNVDDCASSPCANGGTC^ 

?e^aSgatchorghgyvcecarsygg^^ 
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VSHGATOIKNTOKK^FHC^HSADKNGFKARYPAVDYm 
TKCQPQGSSGEEKGTPTTIItGGEAS 



Figure 16 

ATGGCGGCAGCGTCCCGGAGCGCCTCTGGCTGGGCGCTACTGCTGCTGGTC^CACT^ 
GGCAGCAGCGCGCGGCCGGCTCCGGCGTCTTCCAGCTGCAGCTGCAGGAGTTCATCAA 

CGCGTCTGCCTTAAGCACTTCC 

GTCTCCACGCCGGTATTGGGCACCAACTCCTTCGCTGTCCGGGACGACAGTAGCGGCG 
GGGGGCGCAACCCTCTCCAACTGCCCTTCAATTTCACCTGGCCGGGTACOT^ 
CATCATCG\AGCTTGGCACGCGCCAGGAGACGACCTGCGGGCAGAGGCCTTGCCACCA 
GATGCACTCATCAGCAAGATCGCCATCCAGGGCTCCCTAGCTGTGGGTCAGAACTGGT 

T^CTGGATGAGCAAACCAGCACCCTCACAAG 

AGTGACAACTACTATGGAGACAACTGCTCCCGCCTGTGCAAGAAGCGCAATGACCACT 
TCGGCCACTATGTGTGCCAGCCAGATGGCAACTTGTCCTGCCTGCCCGGTTGGACTGGG 
GAATATTGCCAACAGCCTATCTGTCTTTCGGGCTGTCATGAACAGAATGGCTACTGCA 
GCAAGCCAGCA.GAGTGCCTCTGCCGCCCAGGCTGGCAGGGCCGGCTGTGTAACGAATG 

catcccccacaatGgctgtcgccacggcacctgcagcactccctggcaatgtacttgtg 

ATGAGGGCTGGGGAGGCCTGTTTTGTGACCAAGATCTCAACTACTGCACCCACCACTC 

cccatgcaagaatggggcaacgtgctccaacagtgggcagcgaagctacacctgcacc 

TGTCGCCCAGGCTACACTGGTGTGGACTGTGAGCTGGAGCTCAGCGAGTGTGACAGCA 

ACCCCTGTCGCAATGGAGGCAGCTGTAAGGACCAGGAGGATGGCTACCACTGCCTGTG 

TCCTCCGGGCTACTATGGCCTGCATTGTGAACACAGCACCTTGAGCTGCGCCGACTCCC 

CCTGCTTCAATGGGGGCTCCTGCCGGGAGCGCAACCAGGGGGCCAACTATGCTTGTGA 

ATGTCCCCCCAACTTCACCGGCTCCAACTGCGAGAAGAAAGTGGACAGGTGCACCAGC 

AACGCCTGTGCCAACGGGGGACAGTGCCTGAACCAGGTCCAAGCCGCATGTGCCGCTG 

CCGTCCTGGATTCACGGGCACCTACTGTGAACTCCACGTCAGCGACTGTGCCCGTAACC 

CTTGCGCCCACGGTGGCACTTGCCATGACCTGGAGAATGGGCTCATGTGCACCTGCCC 

TGCCGGCTTCTCTGGCCGACGCTGTGAGGTGCGGACATCCATCGATGCCTGTGCCTCGA 

GTCCCTGCTTCAACAGGGCCACCTGCTACACCGACCTCTCCACAGACACCTITGTGTGC 

AACTGCCCTrATGGCTTTGTGGGCAGCCGCTGCGAGTTCCCCGTGGGCTTGCCGCCCAG 

CTTCCCCTGGGTGGCCGTCTCGCTGGGTGTGGGGCTGGCAGTGCTGCTGGTACTGCTGG 

GCATGGTGGCAGTGGCTGTGCGGCAGCTGCGGCTTCGACGGCCGGACGACGGCAGCAG 

GGAAGCCATGAACAACTTGTCGGACTTCCAGAAGGACAACCTGATTCCTGCCGCCAGC 

TTAAAAACACAAACCAGAAGAAGGAGCTGGAAGTGGACTGTGGCCTGGACAAGTCCA 

ACTGTGGCAACAGCAAAACCACACATTGGACTATAATCTGGCCCCAGGGCCCCTGGGG 

CGGGGGACCATGCCAGGAAGTTTCCCCACAGTGACAAGAGCTTAGGAGAGAAGGCGC 

CACTGCGGTTACACAGTGAAAAGCCAGAGTGCGGATATCAGCGATATGCTCCCCCAGG 

GACTCCATGTACCAGTCTGTGTGTTTGATATCAGAGGAGAGGAATGAATGTGTCATTGC 

CACGGAGGTATAA 
Figure 17 

MAAASRS ASGW ALLLLV ALWQQRAAGS GVFQLQLQEFINERGVLAS GRPCEP GCRIFHIV 
CUCHFQAVVSPGPCTFGTVSTPVLGTNSFAV^ 

HAPGDDLRPEALPPDALISKIMQGSIAVGQITWLLDEQTSTLTRLRYSYRVIC^ 

CSI^CKKRNDHFGHYVCQPDGNISC^ 
GWQGRLCbffiCIPHNGCRHGTCSTPWQCTCDEGWGGIEC^ 

SGORSYTCTCPJ»GYTGVDCTLEl^ECDSWCP^fGGSCKDQEDGYHCLGPPGYYGLHCEHS 
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GPSRMCRCRPGFTGTYCELHVSDCAWSIPCAHGGTGHDLEN^ 

VIXGMVAVAVRQIJEUJRRPDDGSR^ 
KSNCGKQQNHTT^YNLAPGPLG^^ 
DSMYQSVCLISEEKNECVIATEV 

Figure 18 

CTCGCAGGCTAGGAACCCGAC5GCCAAGAGCTGCAGCCAAAGTCAOT 
ACTCCCTCACTAGCCCGCTCGAGACCCTAGGATTTGCTCCAGGA^ 

GCCACC(3CCCAGTCGCCCTCACCTGGA1TACCTACCG^ 

TTGAGAAGGCGACAAGGGAGCAC^GTCCCG^ 

GGCAGACGGGACJTTGCGGGAGAGCGACATCCCTAACAAGCAGArTC 

GTGGAGAGGACACCCCAAGGGATGACGCCTGCGTCCCGGAGCGCCTGTCGCTCGGCGT 

ac?^?gSggtactgtggccgcagcagcg^ 
ctgcggctgcaggagttcgtcaaccagcgcgg^^ 

AACCGGGCTGCCGGACTITrCTTCCGCATTTGCCTT 

GAGGGACCCTGCACCTTTGGCAATGTCTCCACCX^^^ 

CGTCAGGGACAAGAATAGCGGCAGTGGTCC^^ 

ACCTGGCCGGGAACCTTCTCACTCAACATCCAAGCTTGG^ 

TGCGGCCAGAGACITCGCCAGGAAACTCTCTCATCAGCCAAATCAT^ 

TCTTGCHGTGGGTAAGATTTGGCGAACAGACGAGCAA^ 
AGCTACTCTTACCGGGTCATCTGCAGTGACAACT^^^ 

ATGCAAGAAGCGCGATGACCACTTCGGACATTATGAGTGCCAGCCAGATGGCAGCCTG 
TCATGAGCAGAATGGTTACTCCAGC^^ 

CAGGGTCGCCTGTGCAATGAATGTATCCCCCACAATGGCTGTCGTCATGGCACCTGCA 
GCATCCCGTGGCAGTCT^ 

GGCCAAAGGGTTATACCTGCACCTGTCTCCCAGGCTAC^ 

GGGACTCAGCAAGTGTGCCAGCAACCCCTGTCGAAATGGTGGCAGCTGTAAGGACCAG 
GAGAATAGCTACCACTGCCTGTGTCCCCCAGGCTACTATGGCCAGCACTGTGAGCATA 

™S™ CT CA€CC^ 
CAGGGGTCCAGTTATCTCCTGCGAATGCCCC 
GAAAGTAGACAGGTGTACCAGCAACCCGTGTGCCAATGGAG^^ 
AGGTCCAAGCCGAACCTGCCGCTGCCGGCCTGGATTCACAGGCACCCACTGTGAACTG 

CACATCAGCGATOTGC^ 

agaItgSSgtgcacctgccccgctg^ 

gataacccacgatgcctgtgcctccggaccctgcttcaatggggccacctgctacactg 

GC^GGTA^ 

GGACAACCTAATCCCTGCCGCCCAGCTCAAAAACACAAACCAGAAGA 

GTGGACTGTGGTCTGGACAAGTCCAATTGTGGCAAACT^ 

ACAATCTAGCCCCGGGACTCCTAGGACGGCXX^ 

tgacaagagcttaggagagaagtgccaotcg^^ 

GAATATCAGCCATITGCTCTCCCAGGGACTCTATGTA^ 
GAAGAGAGGAACGAGTGTGTGATTGCCACAGAGGTATAA^ 
ACCCAGCTCCGGCCCAGCAGCTGGGCCTTCCTTCTGCATTGT^ 
ATGGGACATCTTTAGTATGCACAGTGC^ 
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TMAGCTGG(XCCAAATGAGAAAQCAGAGAGGAGOCn'QCAAAGGATAGWCTCCC 
orrrGAGGAGGGGACACGTGTGCTCCTGCCTCCAACCCCAGCAGGTGGGGTGCCACCl 

TGTGTACATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
Figure 19 

MTPASRSAQIWALLLIAVLW^ 

^t^oa^segpcttgWstp^ 

LTCADSPCR^ 
TraCRPGFTGTH^^^ 

sg?™gat?™gShSf^ 
qsv<xiseernecvtatev 
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Figure 20 



1 2 3 4 5 6 




Western blot analysis of Notch 2 expression in human germ cell tumour derived cell 
lines. 

Western blot probed with antibody specific for the intracellular portion of human 
NOTCH2 and visualised using chemilumenesence. Lanes from left to right 1 : Be Wo, l. 
TERA-1, 3: 833KE, 4: 2102 Ep 2A6, 5: 2102 Ep 4D3, 6: NTERA2/D1 8 days exposure 
to retinoic acid, 7: NTERA2/D1 EC cells. Molecular weight markers are indicated on the 
right in kDa. Notch2 protein product is visualized at apprx 100 kDa 
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Figure 21 



WO 02/077204 



19/66 



PCT/GB02/01195 



Figure 22 

AAACCCACTCCACCTTACTACCAGACAACCTTAGCCAAACCATTTACCCAAATAAAGT 
AI^GGCGATAGAAAT 

'aaaaa^ataaccaagcataatatagca^ 
aattaactag^ 

AGCTACCTAAGAACAGCTAAAAGAGCACAC^ 

ATTTATAGGTAGAGGCGACAAACCTACCGAGCCTGKjTGATAGCTGGTTGTCCAAGATA 
GAATOTAGTrCAACTTTAAATTTGCCCACAGAACCCTCTA^ 

actctSgtccaaagaggaacagctctitgg 

GTGTCAGCCCAATTCCACACTTTTCCACATGTTGGATGGCCTTGG^ 
r^ATTTTTGGAATTCAACTAAAAACTGAAGGATCCTTGAGGACGGCAGTACCTGGCAT 

A^C^AC^TC^^^ 
CGAGTGATCTGTGACAATATCCCTGGTT^^ 

GTTACCCAGACATCATGCGTTCAGTGGGCGAGGGTGCCCGAGAATGGATCCGAGAGTG 

TCAGCACCAATTCCGCCACCACCGCTGGAACTGTACCACCCTGGACCGGGAC^^CACC 

GTCTITGGCCGTGTCATGCTCAGAAGTAGCCGAGAGGCAGCTTTTGTATATGCCATCTC 

ATCAGCAGGGGTGATCCACGCTATTACTCGCGCCTGTAGCCAGGGTGAACTGAGTGTG 

TGCAGCTGTGACCCCTACACCCGTGGCCGACACCATGACCAGCGTGGGACTTTTGACT 

GGGGTGGCTGCAGTGACAACATCCACTACGGTGTCCGTTTTGCCAAGGCCT^ 

GCCAAGGAGAAGAGGCTTAAGGATGCCCGGGCCCTCATGAACTIACATA 

GTGGTCGCACGGCTGTGCGGCGGTTTGTCAAGCTGGAGTGTAAGTGCCATGGCGTGAG 

TGGTTCCTGTACTCTGCGCACCTGCTGGCGTGCACTCTCAGATTTCCGCCGCACAGGTG 

ATTACCTGCGGCGACGCTATGATGGGGCTGTGCAGGTGATGGCCACCCAAGA1GGTGC 

CAACTTCACCGCAGCCCGCCAAGGCTATCGCCGTGCCACCCGGAGTGATCTTGTCTACT 

TTGACAACTCTCCAGATTACTGTGTCTTGGACAAGGCTGCAGGTTCCCTAGGCACTGCA 

GGCCGTGTCTGCAGCAAGACATCAAAAGGAACAGACGGTTGTGAAATCATGTGCTGTG 

GCCGAGGGTACGACACAACTCGAGTCACCCGTGTTACCCAGTGTGAGTGCAAATTCCA 

CTGGTGCTGTGCTGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTCCATACTTGC 

AAAGCCCCCAAGAAGGCAGAGTGGCTGGACCAGAGCTGAACACACAGATACCTCACT 

CATCCCTCCAATTCAAGCCTCTCAACTCAAAAGCACAAGATCGTTGCATGCACACCTTC 

CTCCACCCTCCACCCTGGGCTGCTACCGCTTCTATTTAAGGATGTAGAGAGTAATCCAT 

AGGGACCATGGTGTCCTGGCTGGTTCCTTAGCCCTGGGAAGGAGTTGTCAGGGGATAT 

AAGAAACTGTGCAAGCTCCCTGATTTCCCGCTCTGGAGATTTGAAGGGAGAGTAGAAG 

AGATAGGGGGTCTTTAGAGTGAAATGAGTTGCACTAAAGTACGTAGTTGAGGCTCCTT 

TTTTCTTTCCTTTGCACCAGCTTCCCGACACTTCTTGGTGTGCAAGAGGAAGGGTACCT 

GTAGAGAGCTTCrTTTTGTTTCTACCTGGCCAAAGTTAGATGGGACAAAGATGAATGGC 

ATGTCCCTTCrCTGAAGTCCGTTTGAGCAGAACTACCTGGTACCCCGAAAGAAAAATCT 

TAGGCTACCACATTCTATTATTGAGAGGCTGAGATGTTAGCCATAGTGGACAAGGTTCC 

ATTCACATGCTCATATGTTTATAAACTGTGTTTTGTAGAAGAAAAAGAATCATAACAAT 

ACAAACACACATTCATTCTCrCTTTTTCTCTCTACCATTCTCAACCTGTATTGGACAGCA 

CTGCCTCTTTTGCTTACTTGCTGCCTGTTCAAACTGAGGTGGAATGCAGTGGTTCCCATG 

CTTAACAGATCATTAAAACACCCTAGAACACTCCTAGGATAGATTAATGT 
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Fig 23 



DSL 



ligands) 

\ 



???> 



Pickkopf 
Cerberus 



1 Dally 1 



IX i 

}CBF-l[ |]Dishevelled| 



— > Activation 
— | Inhibition 
»»> Transcription 



Extracellular 



Intracellular 




WO 02/077204 



21/66 



PCT/GB02/01195 



-2.4Kb 



Wnt 

u.f^^C WNT2B 



(Wnt-13) wmwrni 




WNT2B 
(Wnt-13) 



FRP 



■-4Kb — *3T SFRP 1 



-2.8Kb — . 

-2.6Kb - 
~1.5Kb - 

-1.9Kb - 
-13Kb - 



SFRP4 




if FRZB 



SFRP2 



-2Kb 



~6.5Kb 



~3.7Kb- 



Frizzled 



-3.5Kb- 



~1.9Kb 



2> ^ <\>- 



*&||i§¥ i FZD1 
FZD2 

FZD9 




I FZD4 



'4211111 FZD6 



"^"li^SSfj FZD7 



ffff 

IjjJ ^ «jj|f FZD3 



FZD6 



Figure 24 



WO 02/077204 



22/66 



PCT/GB02/01195 



~5Kb 
-2.5Kb 



Dishevelled 



~2.5Kb — 




DVL2 



DVL3 



~1.4Kb 



-3.5Kb 
-2.7Kb 



GSK-3p,Axin,APC 




GSK3B 



AXIN1 



~9Kb 




APC 



TCF 



-3.5Kb 




TCF1 



Figure 25 



WO 02/077204 



23/66 



PCT/GB02/01195 



Figure 26 

ACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGCACGGTTTCGTGGGGA 

CCCAGGClTGCAAAGTGACGGTCATTTTCrCTTTCTTTCTCCCTCTTG 

ATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGTAGCGGCGG 

CTCTCGGCGGCCACCCTGTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAATTCC 

AACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTCTG 

CAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGA 

CAACTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGC 

GCTAGTCCCACCCGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGC 

GCCGAAAACGCTGCATGCGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGG 

AATATGTGTGTCTTCTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTG 

AAAGCTITGGTAATGATCATAGCACGTTGGATGGGTATTGCAGAAGAACCACCTTGTCT 

TCAAAAATGTATCACACCAAAGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACT 

GTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCAAGATCTGTAAACCTGTCCTG 

AAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTCATGGACTAGAAATA 

TTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGATCACCATCA 

AGCCAGTAA1TCTTCTAGGCTTCACACTTGTCAGAGACACTAAACCAGCTATCCAAATG 

CAGTGAACTCCTTTTATATAATAGATGCTATGAAAACCTTTTATGACCTTCATCAACTC 

AATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCATTCCAATAACACCTTCCAA 

AAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTGTGATTGCAGTAAATT 

ACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGAAACTTTTAATTA 

TTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTm 

GATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATTTCAGCTTA 

TAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCAAGGAT 

CTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATTTT 

CTGAAATGTACTATCTTAATGCTTAAATTATAm 

AATAAAATTTAACATTTAATATCATGAAATGTTATAA 
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Figure 27 



CCTGATCCCCTACTAAAGCGCCTG^^ 



TTGACACATAAAGGATAGGAAAAGAGAGGAGAGAAAAG^ 

CTCCTGATC 
GCGGGGCA 

TTCCAGTTATTGCAGAAOTCCTGTCC^ 



GGAGCTGATGCGGGCCTCCTGATCAAT^ 



Trr^rGGGAGATGGCCGCGTTGAlGC^ 
Sr?cC&?GGTGCTGATCGTGGAGAGCT^^ 



CACGGTCATTACTCAAACCATGACTTGGGATGGCAGAATCTAGGAAGAC^CAC 

~ATGAA 
X3ATTTL/ 

AGKjGGGAAGTCTGTACCAAACAACGCAAGAAG^ 



agatgtcacatataAaagggcatgaaggagacccctg^ 

TGAAGGGTTITC5CTGTGCTCGTCATTTCTGGACCAAAA^ 



====== 

ACCTACATGCCAGATTTCTATTCAACGTTAGAGTTTAAC AAAAT ACTCCTA^ l 



TGTTATACAATAGXjTTCTAAAAATAAAATTGGTAAAGAAGAAATG 

^AATTACC 

TGGCCAGTTGTITAGGAAGGCCTTrAGGAAGACA^ 



TCCTOCTCTGAATGCATTA^^^ 
ACTTACACAGAGCAGAATTTTCACAGATCGCAAAAA^ 

f5Tfi<^AAAAGAGCTAACAGAGAGATCATTATTTCTTAAAGATTGGCCATAAC01ulAi 

S^^AGA^GGTAAATACATG^ 
mAGCTGGATCTGTACTGCACTGGAGTAAKAAG^^ 
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CCTGAAGTAGACAGAGTAGTGAGGTTTCATTGCCCTCTATAAGCTTCTGACTAGCCAAT 

GGCATCATCCAATTTTCTTCCCAAACCTCTGCAGCATCTGCTTTATTGCCAAAGGGCTA 

GTTTCGGTTTTCTGCAGCCATTGCGGTTAAAAAATATAAGTAGGATAACTTGTAAAACC 

TGCATATTGCTAATCTATAGACACCACAGTTTCTAAATTCTTTGAAACCACTTTACTACT 

TTITITAAACTTAACTCAGTTCTAAATACTTTGTCTG 

ATCTTATAGTCGTGACTTTAAACTTTrGTAGACCA^ 

CTTAAATCCCATCTGCAGTCTCAAATTTAAGTTCTCCCAGTAGAGATTGAGTTTGAGCC 

TGTATATCTATTAAAAATTTCAACTTCCCACATATATTTACTAAGATGATTAAGACTTA 

CATTTTCTGCACAGGTCTGCAAAAACAAAAATTATAAACTAGTCCATCCAAGAACCAA 

AGTTTGTATAAA.CAGGTTGCTATAAGCTTGGTGAAATGAAAATGGAACATTTCAATCA 

AACATTTCCTATATAACAATTAITATATTTACAATTTGGTTTCTGCAATATT^ 

GTCCACCCTTTTAAAAATTATTATTTGAAGTAATTTATTTACAGGAAATGTTAATGAGA 

TGTATTTTCTTATAGAGATATTTCTTACAGAAAGCTTTGTAGCAGAATATATTTGCAGCT 

ATTGACTTTGTAATTTAGGAAAAATGTATAATAAGATAAAATCTATTAAATTTTTCTCC 

TCTAAAAACTGAATTCAAAGC 



Figure 28 



ACACACAGGCGGCGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCT 

GGTGTGCCTGCTGCTGGCGGCGGCGGTCCCCACGGCeCCCGCGCCCGCTCCGACGGCG 

ACCTCGGCTCCAGTCAAGCCCGGCCCGGCTCTCAGCTACCCGCAGGAGGAGGCCACCC 

TCAATGAGATGTTCCGCGAGGTTGAGGAACTGATGGAGGACACGCAGCACAAATTGCG 

CAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTAAAGCATCATCAGAAGT 

GAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGACACGAAGGTT 

GGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGACTG 

GACAAATGGTCrTTTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAG 

GAGCCACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCC 

AGCTTCCAGTACACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACA 

GTGAGTGCTGTGGAGACCAGCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAG 

GGGCAGCAATGGGACCATCTGTGACAACCAGAGGGACTGCCAGCCGGGGCTGTGCTGT 

GCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCT 

TTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATG 

GAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGCCCCACAGCCACAGC 

CTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGGGAGATCC 

TGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 

CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCC 

TGCGGCTGCCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTG 

TGGGTAGATGTGCAATAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTG 

GGCTGACCAGGCTTCTTCCTACATCTTCTrCCCAGTAAGTTTCCCCTCTGGCTTGACAGC 

ATGAGGTGTTGTGCATTTGTTCAGCTCCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGG 

TGCTTGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTTGCCACCCCTGTCCAGA 

TTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGGCGTTTGTTCTACATGGCTTTGAT 

AATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTGTCCCTCTGATTGGTTTTGGG 

GAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATGCAACAAA 

TGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGCA 

r.ATc* a a TnTTOTrsTTr 1 a rrnw a tt a p a TrnYrTTT A TTP. A TC.C. A GG AGTGTTGCTCAG 
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fTCCTACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGC 

^AAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAACTACTGTTAG 
G^^CACACTGTG^GGCAGCCOTCCTrCT^ 

A4£CTGCAGAAACAGTACTTAGGTAATTGTAG0K3CGAG^ 

AA^TCACTTAGCAGCAA.CTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAG 
^AGGGCTGTGTG AAAC ATGGTTGTAAT ATGCG ACTGCGAAC ACT^ 

TCC^CAA^ 

TATAAAC^TAAGTTGCAITTAGAAATCAAGCATAAATCACTrCAACTGCT 



Figure 29 

GACAAACAGA.CGACGTGCTGAGCTGCCAGCTTAGTGGAAGCTCTGCTCTGGGTCGAGA 

GATCGTGGCGGCCGTCCT 

?CCTOAOTCAAC^CATCAGG^ 

ACAGTCCC^CT^ 

GAGAAGCCGTrCTGTGCTACATGTCGTGGGlTGCGGAGGAGGTGCCAGCGAGATGCCA 

rc A ATATTAGAAAGGCAGCTTGATGAGCAAGATGGCACACATGC AGAAGGAACAACT 

GGGCACCCAGTCCAGGAAAACCAA 
GG?AGGAA^GACAAGA^AGAAAGTTGTCTGAG 

m^TC^CGTCAW 

gcgIctgt^otgg^ 

TCGATTAAGAGTATGCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAA 
TCCACATTGCATTTGAG 



Figure 30 

ATGGGGCTCTGGGCGCTGTTGCCTGGCTGGGTTC 
C^GCTCTGCCCGCAGCCCTGGCTGCCAACAGCAGTGGCCGATGGTGG^ 

GTAGCCTCCTCCACGAACCTGCTTACAGACTCCi^AGAGTCTC 

CAGTCTGCAGCTGTTGAGCCGCAAACAGCGGCGCCTGATA^ 

CTGCACAGCGTGAGTGGGGGGCTGCAGAGTGCCGTGCGCGAGTGCAAGTG^ 

GGAATCGCC(3CTGGAACTGTCCCAC^^ 
AACCGAGGCTGTCGAGAAACGGCGTTTATC™^^ 

rA^^GGCCCCGGGGGCCCCGACTGGCACTGGGGGGGCTGCAGCGACAACATTGACT 
TCGGCXGO^CT^ 

cScA^S 

,/-^T^r,rru-r<TTOr^GATGTGCTGCGCGACCGCTTCGACGGCGC 
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ctcgcgcgtcctgtacggcaaccgcggcagcaaccgcgcttcgcgaggggagctgctg 
cck:ctggagccggaagacccggcccacaaaccgccctccccccacgacctcgtctact 

TCGAGAAATCGCCCAACTTCTGCACGTACAGCGGACGCCTGGGCACAGCAGGCACGGC 

agggcgcgcctgtaacagctcgtcgcccgcgctggacggctgcgagctgctctgctgc 

GGCAGGGGCCACCGCACGCGCACGCAGCGCGTCACCGAGCGCTGCAACTGCACCTTCC 
ACTGGTGCTGCCACGTCAGCTGCCGCAACTGCACGCACACGCGCGTACTGCACGAGTG 

TCTGTGA 



Figure 31 

MGLW ALLPGWVS ATLIXALAALP AALAANS SGRWWGIVNVASSTNLLTDSKSLQLVLEPS 

LOIXSRKQPJILIRQNPGILHSVSGGLQSAV^ 

RETAFIFi^SAGVTHSVARSCSEGSIES 

FVDSGEKGRDLRFLMNLHNNEAGRTTWSEMRQ^^ 

GDVLRDRFDGAJSRVLYOMIGSNRASRAEI^^ 

I&GTAGTAGRACNSSSPAII)GCELLCCGRGIIRTRTQ 

RVLHECL 



Figure 32 

AGCAGAGCGGACGGGCGCGCGGGAGGCGCGCAGAGCTTTCGGGCTGCAGGCGCTCGC 
TGCCGCTGGGGAATTGGGCTGTGGGCGAGGCGGTCCGGGCTGGCCTTTATCGCTCGCT 
GGGCCCATCGTTTGAAACTTTATCAGCGAGTCGCCACTCGTCGCAGGACCGAGCGGGG 
GGCGGGGGCGCGGCGAGGCGGCGGCCGTGACGAGGCGCTCCCGGAGCTGAGCGCTTC 

tgctctgggcacgcatggcgcccgcacacggagtctgacctgatgcagacgcaagggg 
gttaatatgaacgcccctctcggtggaAtctggctctggctccctctgctcttgacctg 

GCTCACCCCCGAGGTCAACTCTTCATGGTGGTACATGAGAGCTACAGGTGGCTCCTCCA 

GGGTGATGTGCGATAATGTGCCAGGCCTGGTGAGCAGCCAGCGGCAGCTGTGTCACCG 

ACATCCAGATGTGATGCGTGCCATTAGCCAGGGCGTGGCCGAGTGGACAGCAGAATGC 

CAGCACCAGTTCGGCCAGCACCGCTGGAATTGCAACACCCTGGACAGGGATCACAGCC 

TTTTTGGCAGGGTCCTACTCCGAAGTAGTCGGGAATCTGCCTTTGTTTATGCCATCTCCT 

CAGCTGGAGTTGTATTTGCCATCACCAGGGCCTGTAGCCAAGGAGAAGTAAAATCCTG 

TTCCTGTGATCCAAAGAAGATGGGAAGCGCCAAGGACAGCAAAGGCATTTTTGATTGG 

GGTGGCTGCAGTGATAACATTGACTATGGGATCAAATTTGCCCGCGCATTTGTGGATGC 

AAAGGAAAGGAAAGGAAAGGATGCCAGAGCCCTGATGAATCTTCACAACAACAGAGC 

TGGCAGGAAGGCTGTAAAGCGGTTCITGAAACAAGAGTGCAAGTGCCACGGGGTGAG 

CGGCTCATGTAGTCTCAGGAGATGCTGGCTGGCCATGGCCGACTTCAGGAAAACGGGC 

GATTATCTCTGGAGGAAGTACAATGGGGCCATCCAGGTGGTCATGAACCAGGATGGCA 

CAGGTTTCACTGTGGCTAACGAGAGGTTTAAGAAGCCAACGAAAAATGACCTCGTGTA 

TTTTGAGAATTCTCCAGACTACTGTATCAGGGACCGAGAGGCAGGCTCCCTGGGTACA 

GCAGGCCGTGTGTGCAACCTGACTTCCCGGGGCATGGACAGCTGTGAAGTCATGTGCT 

GTGGGAGAGGCTACGACACCTCCCATGTCACCCGGATGACCAAGTGTGGGTGTAAGTT 

CCACTGGTGCTGCGCCGTGGGCTGTCAGGACTGCCTGGAAGCTCTGGATGTGCACACA 

TGCAAGGCCCCCAAGAACGCTGACTGGACAACCGCTACATGACCCCAGCAGGCGTCAC 

CATCCACCTTCCCTTCTACAAGGACTCCATTGGATCTGCAAGAACACTGGACCTTTGGG 

TTCTTTCTGGGGGGATATTTCCTAAGGCATGTGGCCTTTATCTCAACGGAAGCCCCCTC 

r^~~~„„^„^^„r,~n k * -mnnnr^nr irAPr^TOTArrTA A A GGCT ACCCTAT 
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TCTATCCATCTCCTGGTGTTCTGCA 

AGCATGACAGGCTGTTCAGCCGGGAGGGTGGTG^CCCAGACCACT^ 

SSE^QACnTTGI^ 
ATTACCTCKjAGAAGAATGGCTITCAATACCC^ 

CTATGGTGTGTTGTATCTTTGTAAGAGCAAAAGCCTCAGAAAGGGATTGCTTTGCATTA 

CTGTCCCCTTGATATAAAAAATCTITAG 
AAGGGAATTAAAAAGAAGATGAATGGTCTGGCAATATTCT 

TGGTGGAAAATAATTTAGTGGATGGAATATCAGAAGTATATCTGTACAGATCAAGAAA 
AAAAGGAAGAATAAAATTCCTATATCAT 

Figure 33 •• ■ ■ ■ ' 

MNAPIXKjIWLWIJ'IXLTWLTPEVN 

PDVMRAISQGVAEWTAECQHQFRQHRWNCKILDRDHSLFGRV^ 
VWAITRACSQGEVKSCSCDPKKMGSA 

RKYNGMQVVMNQDGTGFTVANERFK^ 
^TSRGMDSCEVMCCGRGYDTSHVTRMIKCGCKFHW 

NADWTTAT 

Figure 34 

CGGGAGTCTTCGGGGAGCTATGCTGA^ 

CCGCTTCGGCGCGCCAGCGCCCCGGTCCCTGTGCCGTCGCCCGCGGCCCCCGACGGCTC 

CCGGGC^CGGCCC^CTAGGTCTTGCCTGCCTTCTGCTCCT 

CGGCCCGCGTAGACACGTCCTGGTGGTACATTGGGGCACT 

TGACAATATCCCTGGTTTGGTGAGCCGGCAGCGGCAGCTGTGCCAGCGTTACCCAGAC 
ATCATGCGTTCAGTGGGCGAGGGTGCCCGAGAATGGATCCGAGAGTGTCAGCACCAAT 

TCCGCCACCACCGCTGGAACTGTACCACCCTGGACC^ 

GTCATGCTCAGAAGTAGCCGAGAGGCAGCTTTTGTATATGCCATCTCATC 

TAGTCCACGGTATTACTCGCGCCTGTAGCCAGGGTGAACTGAGTGTGTGCAG^ 

CCCTACACCCGTGGCCGACACCATGACCAGCGTGGGGACITTGACTGGGGTGOT 

GTGACAACATCCACTACGGTGTCCGTlTrGCCAAGGCCT^ 

GAGGCTTAAGGATGCCCGGGCCCTCATGAACTTACATAATAACCGCTGTGGTCGCACG 

GCTGTGCGGCGGTTTCTGAAGCTGGAGTGTAAGTGCCATGGCGTGAGTGGTTC^ 

TCTGCGCACCTGCTG(3CGTGCACTCTCAGATTTCCGCCGCACAGOTCA™ 

GACGCTATGATGGGGCTGTGCAGGTGATGGCCACCCAAGAT 
AGCCCGCCAAGGCTATCGCCGTGCCACCCGGACTGATCTTGTCTACTTTGACA^ 

CAGATTACTGTGTCTTGGACAAGGCTGCAGGTTCCCT^ 

AGCAAGACATCAAAAGGAACAGACGGTTGTGAAATCA^ 

GACACAACTCGAGTCACCCGTGTTACCCAGT^^^ 

TGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTCCATACTTGCAAAG 

AAGGCAGAGTGCK^GACCAGACCTGAACACACA^^ 
CA^GCCTCTCAACTCAAAAGCACAAGATCCTTGCATGCACACCTTCCT 

CCTGGGCTGCTACCGCTTCTATT^^ 
TCCTGGCTGGTTCCTTAGCCCTGGGAAGGAGTTGTC^ 

AGCTCCCTGATTrCCCGCTCTGGAGATTTGAAGGGAGAGTAGAAGAGATAGGGGGTCT 
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ACCAGCTTCCCGACACTTCTTGGTGTGCAAGAGGAAGGGTACCTGTAGAGAGCrTCTTT 

TTGTTTCTACCTGGCCAAAGTTAGATGGGACAAAGATGAATGGCATGTCCCTTCTCTGA 

AGTCCGTTTGAGCAGAACTACCTGGTACCCCGAAAGAAAAATCTTAGGCTAGCACATT 

CTATTATTGAGAGCCTGAGATGTTAGCCATAGTGGACAAGGTTCCATTCACATGCTCAT 

ATGTTTATAAACTGTGTTTrGTAGAAGAAAAAGAATCATAACAATACAAACACACATT 

CATTCTCTCTTITTTCTCTCTACCATTCTCAACCTGTATTGGACAGCACTGCCTCTm 

TACTTGCTGCCTGTTCAAACTGAGGTGGAATGCAGTGGTTCCCATGCTTAACAGATCAT 

TAAAACACCCTAGAACACTCCTAGGATAGATTAATGT 



Figure 35 

MLDGLGWAIS1FGIQLKTEGSIJRTAWG1PTQSAFNKCLQRYIGALGARVICDNPGLVSRQ 

RQLCQRYPDIMRSVGEGAREW3RECQHQFRHHRWNCTTIJDRDHTWG 

VYAISSAG\THAITRACSQGEI^VCSGDPYTRG^ 

VDAKEKRLKD ARALMNIJIr^CGRTAWRFA/KLECK^ 

GDYIRERYDGAVQVMATQDGAb^AARQGYP^ 

TAGRVCSKTSKGTOGCEMCCGRGYDTTRVTRWQCECB^FHW 

CKAPKKAEWLDQT 



Figure 36 

GCGCTrCTGACAAGCCCGAAAGTCATTTCCAATCTCAAGTGGACTTTGTTCCAACTATT 

GGGGGCGTCGCTCCCCCTCYTCATGGTCGCGGGCAAACTTCCTCCTCGGCGCCTCTTCT 

AATGGAGCCGCACCTGCTCGGGCTGCTCCTCGGCCTCCTGCTCGGTGGCACCAGGGTCC 

TCGCTGGCTACCCAATTTGGTGGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGGC 

TCACAGCCCCTGCTCTGCGGCTCCATCCCAGGCCTG<3TCCCCAAGCAACTGCGCTTCTG 

CCGCAATTACATCGAGATCATGCCCAGCGTGGCCGAGGGCGTGAAGCTGGGCATCCAG 

GAGTGCCAGCACCAGTTCCGGGGCCGCCGCTGGAACTGCACCACCATAGATGACAGCC 

TGGCCATCTTTGGGCCCGTCCrCGACAAAGCCACCCGCGAGTCGGCCTTCGTTCACGCC 

ATCGCCTCGGCCGGCGTGGCCTTCGCCGTCACCCGCTCCTGCGCCGAGGGCACCTCCAC 

CATTTGCGGCTGTGACTCGCATCATAAGGGGCCGCCTGGCGAAGGCTGGAAGTGGGGC 

GGCTGCAGCGAGGACGCTGACTTCGGCGTGTTAGTGTCCAGGGAGTTCGCGGATGCGC 

GCGAGAAGAGGCCGGACGCGCGCTCGGCCATGAACAAGCACAACAACGAGGCGGGCC 

GCACGACTATCCTGGACCACATGCACCTCAAATGCAAGTGCCACGGGCTGTCGGGCAG 

CTGTGAGGTGAAGACCTGCTGGTGGGCGCAGCCTGACTTCCGTGCCATGGGTGACTTCC 

TCAAGGACAAGTATGACAGCGGCTCGGAGATGGTAGTAGAGAAGCACCGTGAGTCCCG 

AGGCTGGGTGGAGACCCTCCGGGCCAAGTACTCGCTCTTCAAGCCACCCACGGAGAGG 

GACCTGGTCTACTACGAGAACTCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGTT 

CCTiTGGCACAAGGGACCGGACTTGCAATGTCACCTCCCACGGCATCGATGGCTGCGA 

TCTGCTCTGCTGTGGCCGGGGCCACAACACGAGGACGGAGAAGCGGAAGGAAAAATG 

CCACTGCATCTTCCACTGGTGCTGCTACGTCAGCTGCCAGGAGTGTATTCGCATCTACG 

ACGTGCACACCTGCAAGTAGGGCACCAG 
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MBEHIIXHIIXHIIXXSTRVL^ 
YIEIMPSVAEGVKLGIQECQHQFRGRRWNC^ 

AFAVTRSCAEOTOTCG^ 

SHGID GCDLLCCGRGHNTRTEKRKEKCHCI 



.Figure 38 

ATGAGTCCCCGCTCGTGCCTGCGTTCGCTGCGCCTCCTCGTCTTCGCCGTCTTCTCAGCC 
GCCGCGAGCAACTGGCTGTACCTGGCCAAGCTGTCGTCGGTGGGGAGCATCTCAGAGG 

AGGAGACGTGCGAGAAACTCAAGGGCCT 
GGAACCTGGAAGTCA^GGACTCGGTGCG 

TCGGCAAGGTGGTGACGCAAGGGATTCGGGAGGCGGCCTTGGTGTAGGGC^ 
GGCAGGTGTGGCCTITGG AGTGACGCGGGCGTGC AGCAGTGGGGAGCTGGAGAAOTGC 

GGCTGTGACAGGACAGTGCATGGGGTCAGCCCACAGGGCTTC 

CTGACAACATCGCCTACGGTGTGGCCTTCTCACAGTCGTTTGTGGATGTGCGGGAGAGA 

AGCAAGGGGGCCTCGTCCAGCAGAGCCCTCATGAACCTCCA^ 
GGAAGGCCATCCTGACACACATGCGGGTGGAATGCAAGTGCCACGGGGTGTCAGGCTC 

CTGTG^TAAAGACGTGCTGGCGAGCCGTGCCGC 

CTGAAGGAGAAGTTTGATGGTGCCACTGAGGTGGAGCCACGCCGCGTGGGGTOCTCCA 
GGGCACTGGTGCCACGCAACGCACAGTTCAAGCCGCACACAGATGAGGACTTCGTGTA 

CTTOGAGCCTAGCCCCGACTrCTGTGAGC 

GTGGCCGCGGCTTCCACACGGCGCAGGTGGAGCTGGCTGAACGCTGCAGCTGCAA^ 
CCACTGGTGCTGCTTCGTCAAGTGCCGGCAGTGCCAGCGGCTCGTGGAGTTGCACACG 

TGCCGATGA 
Figure39 

MSPRSCLRSLRIXWAX^AAASN^^ 

EVMDSVRRGAQIAIEECQYQF^ 

FAVIRACSS^ 

RAllv'INLHNNEAGRKAILTHIVlRVECKCHGVS GS CEVKTC WEAWPFRQVGHALKEKFD G 
ATEVEPRRVGSSRALWRNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGmGRTCNKTS 

KAID G CELLCCGRGFHT AQ VEL AERCS CKFHW CCFVKCRQCQRLVELHTCR 



Figure 40 

ATTAATTCTGGCTCCACTTGTTGCTCGGCCCAGGTTGGGGAGAGGACGGAGGGTGGCC 
GCAGCGGGTTCCTGAGTGAATTACCCAGGAGGGACTGAGCACAGCACCAACTAGAGA 
GGGGTCAGGGGGTGCGGGACTCGAGCGAGCAGGAAGGAGGCAGCGCCTGGCACCAGG 
GCTTTGACTCAACAGAATTGAGACACGTTTGTAATCGCTGGCGTGCCCCGCGCACAW 

ATCCCAGCGAAAATCAGATTTCCTGGTGAGGTTGCGTGGGTC 
GAAACTGCCTATATCTTGCCATCAAAAAACTCACGGAGGAGAAGCG^ 
AGTAAACTTAAGAGACCCCCGATGCTCCCCTGGTTTAACITGTAT 
TGAGAGGGAATAAACATCTTTTCCTTCTrCCCTCTCCAG^^ 

mnrzr. a Tnacmo A A GTGT.A A TGTCTTCC AAGTTCTTCCTAGTGG 
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CTTTGGCCATATTTTTCT^ 

gtgaWagataggcagccgcgagacggccttcacatac(x:cgtgagcgca^a^ 
ctgSggccatga^ 

GCCGCGCCGCGC0K:CCCAA<3GACC^^ 

CAACATCGACTATGGCTACCGCTTTGCCAAGGAGTTCGTGGACGCCCGCGAGCGGGAG 

ACGAGGCCGGCCGCAGGACCX3TGTACAACCT^ 
GGTGTCCGGCTCATGTAGCCTGAAGACATGCTGGCT^ 
GTGGGTGATGCCCTGAAGGAGAAGTACGACAGCGCGGCG^ 
CGGGGCAAGTTGGTACAGGTCAACAGCCGCTTCAACT 

TCTACATCGACCCCAGCCCTGACTACTGCGTGCGCAATGAGAGCACCGGCTCGCTG^ 

CACGCAGG^ 
ScG^CGT^ 

AGTTCCACTGGTGCTGCTACGTCAAGTGCAAGAAGTGCACGGAGATCGTGGACCAGTT 
TGTCTGCAAG^ 

SIgaaaotacagt^ 

AA^TGCAACC^^ 

AATATTATAATTATTATTTGGCAATAATGGGGGTGGGAACCACGAAAAATATTTATTTT 

AATAACACATACGCTAACTTAGCTGTGTGGGACATGGTACACATCCAGAAGGTAAAGA 
AATACATTTTCITITICTCA^ 

GAGGGTGGTAGAAATCTATTCACAATTCAGGTTCTATGACCAAAATC 

TCTGGTGGAAGATAAAAGGTCTTGGGAAAACAAAACAAAACAAAACAAA 

rCCAGCAGGGCTGCTAGCTTGCmCTGCATTTTCAAAATGATAATTTACAATGGAAGG 

ACAAGAATGTC^^ 
ATCCAm^ 

ggaaa^gccTgaaat^ 

CTGTTATAAGAATTGGGATTCCAGATTTGTAAA 
GATTTTTTGTTTGGGGAGGTTGGCTTGAACATAAATGAAAT^ 

GATACTnKjTTAGTAAATTATAATAGTAGA 

CTCAGCCCAAGCAACAAGGTAATTGCGTGCCATTCAGCAC^ 

CrATTTGAGGAAAAACAOTGAAATCCACCrrCCTCTTCACACT^ 

ctccgtgttgtgatgtgatgctggg^ 

TTGGrTGTAGGACAGGAAATGAAACATTAGGAGCTCTGCTTGGAAAACAGTTCACTAC 

TTAGGGArrmGTTTCCTAAAACTTTTATTTTGAGGAG 
TGACAGAACTTGGCTAATGGAATTCACAGAGGTGTTGCAC^ 

TGTGTTTAGATTATCCACTCATGCTTCTCCTATTGTACTGCAGGTGTACCTTAAAACTOT 
TCCCAGTGTACTTGAACAGTTGCATTTATAAGGGGGGAAATGTGGTTTAATC^T 
ATATCTCAAAGTCTmGTACATAACATATATATATATATACATA^M 
TATAAATATATCTCATTGCAGCCAGTGA^ 

TCTGTCTAGAGCATTGTTGTCCTTCACTGCAGTCCAGTTGGGATTATTCCAAAAG 
TGAGTCTTGAGGTTGGGCTGTGGCCCCGCTGTGATCATACCCTGAGCACGACGAAGCA 

ACCTCGTTTCTCAGGAAGAAGCTTGAGT^ 

gatatct^ 

GTGGAGAGGGCATTACTTGTTCGTTATAGACA 

AGAAGCATCAGCAATGTTTCTCTTTTCTT^ 

TATTAGAAATGACAGTACTTATTAATTGAGTC 

~ . ~ . ^^^t^t^t^o^t^^^ ATA A nCrA n ACCTCTTTCCAAACAGGCCATCA 
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AAG 

Figure 41 

S^L^G^S^ScCGRGYDQFKTVQTmCHCKFHWCCYVKCKK 
CTEIVDQFVCK 



Figure 42 

GGCACGAGCGCAGGAGACACAGGCGCTGGCTG^ 

CCGG 

CTTCCGCCGCTGGAA^TC^^^^ 



TTCGGGGACGAGAAGTCGAGGCTCTTTA^^ 

GACATCCGCGCGTTGGTGCAACTGCACAACAACGAGGCGGGCAGGCTGG 
AGCCACACGCGCACCGAGTGCAAATGCCACC^ 

CCTGCTGGCAGAAGCTGCCTCCATTTCGCGAGGTGGGCGCGCGGCTCCTO 

CCACGGCGCCTCACGCG^^ 
C5SCGCICAAG(XXKXX3GC3C^ 
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GAGAGCGTGCACKTCGAAGAGAACTGCCTGTGCCGCTTTC 

AGTGCCACCGTTGCCGTGTGCGCAAGGAGCTCAGCCTCTGCCTGTGACCCGCCGCC 
CGGCCGCTAGACTGACTTCGCGCAGCGGTGGCTCGCACCTCTGGGACCTCA 

GGCACCGGGCGCCTCTCGCCGCTCGAGCCCAGCCTCTCCCTGC^ 

AGGGCTCTGGAAATGGTGAGGCGAGGGGCnTGAGAGGAACGCCCACCCACGA 

CAGGGCGCCAGACGGGCCCGAAAAGGCGCTCGGGGAGCGTTTAAAGGACACTGTACA 

GGCCCTCCCTCCCCTTGGCCTCTAGGAGGAAACAGTTTTTTAGACTGGAAAAAAG 

GTCTAAAGGCCTCTGGATACTGGGCTCCCCAGAACTGCTGGCCACAGGATGGTGGGTG 

aggttagtatcaataaagatatttaaao:aaaaaaaaaaaaaaaaaaaaa 

Figure 43 

MU>PLPSRLGLLLLLLLCPAHVGGLWAVGSPL^ 

WAELARGARLGVP^CQFQFRFRRWNCSSHSKAFGRILQQDIRETAPVFAITAAGASHAVT 

OACSMGEIXQCGCQAPRGRAPPRPSGIJGTPGPPGPAGSPEGSAAWEWCJGCGDDWGD 

EKSRLFMDARHKRGRGDIRALVQLHNNEAGRIAVRSHTR.TECKCHGLSGSCALRTCWQK 

LPPFREVGAPXLERFHGASRVMGTNDGKALIJ'AVRTIKPPGRADI^ 
TGSPGTRGRACNSSAPDI^GCDLLCCGRGHRQESVQI^ENCLCRFHWCCVVQCHRCRVRK 

ELSLCL 



Figure 44 

CACGCGTCCGGGCCAATCGGGACTATGAACCGGAAAGCGCTGCGCTGCCTGGGCCACC 

TCTTTCTCAGCCTGGGCATGGTCTGCCTCCGGATCGGTGGCTTCTCCTCAGTGGTAGCTC 

TGGGCGCAACGATCATCTGTAACAAGATCCCAGGCCTGGCTCCCAGACAGCGGGCGAT 

CTGCCAGAGCCGGCCCGACGCCATCATCGTCATAGGAGAAGGCTCACAAATGGGCCTG 

GACGAGTGTCAGTTTCAGTTCCGCAATGGCCGCTGGAACTGCTCTGCACTGGGAGAGC 

GCACCGTC1TCGGGAAGGAGCTCAAAGTGGGGAGCCGGGACGGTGCGTTCACCTAOT 

CATCATTGCCGCCGGCGTGGCCCACGCCATCACAGCTGCCTGTACCCATGGCAACCTG 

AGCGACTGTGGCTGCGACAAAGAGAAGCAAGGCCAGTACCACCGGGACGAGGGGTGG 

AAGTGGGGTGGCTGCTCTGCCCiACATCCGCTACGGCATCGGCTTCGCCAAGGTCmGT 

GGATGCCCGGKjAGATCAAGCAGAATGCCCGGACTCTCATGAACTTGCACAACAACGA^j 

GCAGGCCGAAAGATCCTGGAGGAGAACATGAAGCTGGAATGTAAGTGCCACGGCGTG 

TCAGGCTCGTGCACCACCAAGACGTGCTGGACCACACTGCCACAGTTTCGGGAGCTGG 

GCTACGTGCTCAAGGACAAGTACAACGAGGCCGTTCACGTGGAGCCTGTGCGTGCCAG 

CCGCAACAAGCGGCCCACCTTCCTGAAGATCAAGAAGCCACTGTCGTACCGCAAGCCC 

ATGGACACGGACCTGGTGTACATCGAGAAGTCGCCCAACTACTGCGAGGAGGACCCGG 

TGACCGGCAGTGTGGGCACCCAGGGCCGCGCCTGCAACAAGACGGCTCCCCAGGCCAG 

CGGCTGTGACCTCATGTGCTGTGGGCGTGGCTACAACACCCACCAGTACGCCCGCGTG 

TGGCAGTGCAACTGTAAGTTCCACTGGTGCTGGTATGTCAAGTGCAACACGTGCAGCG 

AGCGCACGGAGATGTACACGTGCAAGTGAGCCCCGTGTGCACACCACCCTCCCGCTGC 

AAGTCAGATTGCTGGGAGGACTGGACCGTTTCCAAGCTGCGGGCTCCCTGGCAGGATG 

CTGAGCTTGTCTTTTCTGCTGAGGAAGGTACTTrTCCTGGGTTTCCTGCAGGCATCCGTG 

GGGGAAAAAAAATCTCTCAGAACCCTCAACTATTCTGTTCCACACCCAATGCTGCTCCA 

CCCTCCCCCAGACACAGCCCAAGTCCCTCCGCGGCTGGAGCGAAGCCTTCT 

GAACTCTGGACCCCTGGGCCTCATCACAGCAATATTTAACAATTTATTCTGATAAAA^ 

AATATTAAirrATTTAATTAAAAAGAATTCTTCCACCTCAAAAAAAAAAAAAAAAAAA 

AAAAAAAAGGGGGG 



T-!Z „ AC 
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MKRKARRCLGHIJT^LGMVyi^GGFSSWALGAS 



rask^ 

GCDIMCCGRGYNTHQYAJRWQCNCKIOTCCYVKOTCSERTEMYTCK 



Figure 46 



MHRNFRKWlFyVFI^ 

EGAQMGlMiCQYQFRFGRWNCSALGEKTWGQEIJl^ 
CSO^G^NCGCTREKQGYYNQAEGWKWGGCSADVRYGIDFSRRFW 



VVRASR1RQPTFLRIKQLRSYQKPMETDLVYIEKSPNYCEEDAATGSVGT 
ADGCDTMCCGRGYNTHQYTKWQC^CKFHWCCFVKCNTCSERTEVFTCK 



Figure 47 

TrrrK^TTACACACCAAGGAAAGTTGGGGTTTGAAGAATTCCATCCCCATGGCCACTGG 

===== 

atgcaagta^ 

tSStggtgggotc^ 

GTTCTGCTGGAGTCATGTACACCCTGACTAGAAACTG^ 

TGTGGCTGTGATGACTCCCGCAACGGGCAACTG^GGACAA^ 

GCTGCAGTGACAATGTGGGCTTCGGAGAGGCGATTOGAAGCAG^ 

GGAAACAGGACAGGATGCACGGGCAGCCATGAACCTGCACAACAAC^ 

CAAGGCGGTGAAGGGCACCATGAAACGCACGTGTAAGTGCCATGGCGTGTCTG^AGC 

TGCACCACGCAGACCTGTTGGCTGCAGCT 

toaIgSgtaccacgcagcac^^^ 

gcck:ggccgcccgcggcgccatcgccgacacctttcgctccatctctacccg^^ 

GCTGCGCACAAACCCGGGAGAAAACCCTAAGGGTTTCCTCTGCrcCCT(^TT^CCCAC 
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CTAGGAAATACGTTA^ 
TGCACCTCTAGA 



Figure 48 

IX3GQGWLWGGCSDNVGFGEAISKQFVD^ 

TCKCHGVSGSCTTQTCWLQLPEFKEVGAHIJ^KYHAAIXVDLLQGAGNSAAA^ 
EERRAETVSSC^CKFHWCCAWCEQCRERVTKYFCSRAERPRGGAAHKPGRXP 



Figure 49 

G0CKKXXKX5IO3ACGGAGGGGCT^ 
CGGTGAGAGCAAAGCCAGAGCCCCCAGTCCiTTOT 

TCACTCCCTCCCTTCCTCCCTCCC1TCCTCCCGGCGGCCGCGGCGGCGCTGGGGAAGCG 

GTGAAGAGGAGTGGCCCGGCCCTGGAAGAATGCGGCTCTGACAA^^ 
AGCGCAGTCTCCCCACGGTTTAAGCAGCACTAGTGAAGCCCAGGCAACCCAACCGTGC 

CTCTOCGGACCCCGCAC^CCAAA^ 

GGGTCTCCTGTTCCTGGCGTTGTGCAGTCGGGCTCTAAG^ 
CA^CTG^ 

ACTGGTCCGCGGTTGAGGGCGGCGGCCGCCTGCCGCACCACAGCGCCATCCTCAAGCG 
CGGTTTCCGAGAAAGTC 

TAGCCACGGCCTGCAGCCTGGGCAAGCTGGTGAGCTGTGGCTGTGGCTGGAAGGGCAG 

SagSggatcggctgagggccaaactg^^ 

AAGAGT1TCCCCCACTCTCTGCCCAGCCCTGGCCCTGGCTCAAGCCCCAGCC 

CCAGGACACATGGGAATGGGGTGGCTGTAACCATGACATGGACTITG^^^ 

TCTCGGGATTrCTTGK5ATTCCAGGGAAGCTCCCCGGGACATCCAGGCACGAAT(^GAA 

TCCACAACAACAGGGTGGGGCGCCAGGTGGTAA^^^ 
AGTGTCATGGCACATCAGGCAGCTGCCAGTTCAAGACATGCT^ 

GTTCCGGGCAGTGGGGGCGGCGTTGAGGGAGCGGCTGGGCCGGGCCATCTTCA^GAT 

ACCCACAACCGCAATTCTGGAGCCTTCCAGCCCCGTCTGCGTCCCCOT 

AGAGCTGGTCTACTTTGAGAAGTCTCCTGACTTC 

OXXttGGGACAAGGG^^ 

CAGCCTGTGCTGTGGCCGTCKJGCACAACGTGCTCCGGCAGACACGAG^ 

CATTGCCGCTrCCACTGGTGCTGCTATGTGCTGTGTGATGAGTGCAAGG 

GGTGAATGTGTGTAAGTGAGGGTCAGCCTTACCTTCGGGCTGGGGAA 

r>. a n a nnnnrP^PTTTTr a rjrrrTTTGrTfTGATTTCCTTCCAAGGTCACTCTTGGTCCCT 
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aagaWgtatctagaaagaga^^ 

TG^GCCTGGGCTCCCTCTTGTC 
SaAAGAGACAGAGTCX:TGA<^^ 

CAGGCCCTTAGGGAAGTTGTCTCCTTCCATTCAGATGTTAATGGGGACCC^ 



AGTTACTGGAAAAAAAAAAAAAAAAAA 
Figure 50 



GSSPSPGPODTWEWGGCNHDMDFGEKFSRDFLDSREAPKDIQAR^ 



LAAGVMHAVATACSLGKLVSCGCG 
TWEWGC 

CHCRFHWCCYVLCDECKVTEWVNVCK 
Figure 51 

CGCGACGATGAGGGCGCGGCCGCAGGTCTGCGAGGCGCTGCT^ 
GTCATGAAGGCCTGTCCWCGGGCCmGCCGAC^^^ 

GCTCGCCCCCAACTATTTGCTTGACCTGGAGAGAGGGACCCG^GAGTCGGCC^CGjrG 



cccggctgctcct 
gatgctcctatgaaggtgaI^^ 



ACAAC^GTGAAGTGGGGAGACAGGCTCTGCGCGCCTCTCTGGAj^TG^OTCT^^rc 
CCATGGGGTGTCTGGCTCCTGCTCCATCCGCACCTGCrGGAAGGGGCT 

gtgggctcccacgggacacaagacaggcagtgcaacaagactc^ 

^^r^^rCGCTATGTCTOCASTOA^CCCTGCCCTCCGCCCCACGCAGGAGCGAG 
CTCTGOTGTGAATTCCAGATGCCAGGCATG^ 
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GTGTAAGGAGCGGGGCITGGGATCGGTGAGAC^^ 

ACAGAGACCAGCCTCCGGGAAGGGGTCTGCCCGCCTC 

CCCCTGGCCCACCCTGKjGGTCTGAGCCTGCTGGGCCACCACATGGAATCACT 

r^TOTAAATGTTTTCTmGTTIKITGCTTm 
AITITANATATATAAATAM^^ 

TTTAAGCAGCTGTATGAAATAAATGCTGAGTGAGCCCCAGC^ 

GGOCICGTCAAGTCAACnXXX^ 

CGGGCA 

Figure 52 

MRAKPQVCFALLFAIALQTC 
SNLELMHTVVEiA^ 

TRYlJsATKVVHRPMGTRKHLW 
Figure 53 

nGrGCGGCAAGATGCTGGATGGGTCCCCGCTGGCGCGCTGGCTGGCCGCGGCCTTCGG 
^GAGCrcCTGACCATCCTCCCGCTGACCCT 

rTACAAGGCGTGCGACCGGCTGAAGCTGGAGCGGAAGCAGCGGGGGATGTGCCGCCG 

gSgS 

CCAGTTCeAGTTCCGCTTTGAGCGCTGGAACTGCACGCTCGA 
AGCCTGCrCAA^GAGGCTTCAAGGAGACTGCCTTCCT^^^ 

cctgacgcacgcactggccaaggcgtgca<k:gcgggccgcatg^ 

GATGACKJCACCCGACCTGGAGAACCGTGAGGCCTGGCAGTGGGGGGGCTGCGGAGAC 
ATfTGCGAGCCCGTGTGGACTTCCACAACAACCTCGTGGGTGTGAAGGTGATCAAGGC 

TGCTGGCGGCAGTTGGCGCCTTTCCATGAGGTGGGCAAG 
AGACGGCACTCAAGGTGGGCAGCACCACCAATGAAGCTGCCGGCGA^ 

TCTCCCCACCAC(X3GCK;CGTGCCTCGGGGGCAOT^ 
TCCAGAGCTGGTGCACCTGGATGACTCGCCTAGCTTCTGCCTGGCTGGC^CGC 

CGGGCACCGCTGGGCGTAGGTGCCACCGTGAGAAGAACTGCGAGAGCATCT 
rrGTGGCCATAACACACAGAGCCGGGTGGTGACAAGGCCCTGCCAGTGCCAGGTGCGT 

?ggtotg^mgTgW 

gtgtgaagggtctacacctgcacaggctgagttcct^^ 

TGGGGTACAGGCATTOTCACACAGTGTGAATG^ 

TGGGCTCAGACCTAGCAGCGTGGGGTAGTCCCTGGGCTCAG^ 

GCAGGCATTGCACAGAGCATGAATGGGCCTACACCTGCCAAGGCTGAATC^ 

CAGCCAGCCCTGCTGCACATGGCACAGGCATrGCACACGGTGTGA^ 
GCAAGGGCTGAGGCCCTGGGCCCAGTCAC^CCTGCTGCTCA^ 
ATGGTGTGAGAAGGTCTACACCTGCAAGGGACGAGTCCCCGGGCCTGGC^ 
TGTGCAGGGTGAGGGCCATGCATGCTAGTATGAGGGGTCTACACCTGCAAGGACTGAG 

AGGCTTTT 
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Figure 54 

MIJC)GSPLARWIAAAFGLTIJXAALRPSAAYFGLTGS 
LKXERKQRRMCRRDPGVAETLVEAVSMSALECQFQFRFERW^ 

taflyaSsaglthalmagsagrmerctcdeapdle^awqwggcgdnlkysskf^ 

EFLGRRSSKDIJIARVDFHNNLVGVK^^ 

KHLKHKYBTAIJCVGSTTN^ 
GRFSPGTAGRRCHREKNCESICCGRGHrTTQSRVV^ 

TCKG 

Figure 55 

AGCCTGCAAAAA.CCACAGAGGGCAAAGCCAGAAAGATGGAAAGGCACCCACCCAT 

AGCTC ACC A.CTTGCCTCAGGGAGACCCTCTTCACAGGGGCTTCTCAAAAGACCTCCCTA 

TGGTGGTTGGGCATTGCCTCCTTCGGGGTTCCAGAGAAGCTGGGCTGCGCCAATTTGCC 

GGTGAACAGCCGCCAGAAGGAGCTGTGCAAGAC5GAAACCGTACCTGCTGCCGAGCAT 

CCGAGAGGGCGCCCGGCTGGGCATTCAGGAGTGCAGGAGCCAGTTCAGACACGAGAG 

ATGGAACTGCATGATCACCGCCGCCGCCACTACCGCCCCGATGGGCGCCAGCCCCCTC 

TTTGGCTACGAGCTGAGCAGCGGCACCAAAGAGACAGCATTTATTTATGCTGTGATC 

CTGCAGGCCTGGTGCATTCTGTGACCAGGTCATGCAGTGCAGGCAACATGACAGAGTG 

TTCCTGTGACACCACCTTGCAGAACGGCGGCTCAGCAAGTGAAGGCTGGCACTGGGGG 

GGCTGCTCCGATGATGTCCAGTATGGCATGTGGTTCAGCAGAAAGTTCCTAGATTTCCC 

CATCGGAAACACCACGGGCAAAGAAAACAAAGTACTATTAGGAATGAACCTACATAA 

CAATGAAGCTGGAAGGGAGGCTGTCGCCAAGTTGATGTCAGTAGACTGCCGCTGCCAC 

GGAGTTTCCGGCTCCTGTGGTGTGAAAACATGCTGGAAAACCATGTCTTCTTTTGAAAA 

gattggccatttgttgaaggataaatatgaaaacAgtatccagatatcagacaaaata 

AAGAGGAAAATGCGCAGGAGAGAAAAAGATCAGAGGAAAATACCAATCCATAAGGAT 

GATCTGCTCTATGTTAATAAGTCTCCCAACTACTGTGTAGAAGATAAGAAACTGGGAA^ 

CCCAGGGACACAAGGCAGAGAATGCAACCGTACATCAGAGGGTGCAGATGGCTGCAA 

CCTCCTCTGCTGTGGCCGAGGTTACAACACCCATGTGGTGAGGCACGTGGAGAGGTGT 

GAGTGTAAGTTCATCTGGTGCTGCTATGTCCGTTGCAGGAGGTGTGAAAGCATGACTG 

ATGTCCACACTTGCAAGTAACCACTCCATCCAGCCTTGGGCAAGATGCCTCAGCAA^IAT 

ACAATGGCATTGCAACCAGAGAGGTGCCCATCCCTGTGCAGCGCTAGTAAAGTTGACT 

CTTGCAGTGGAATCCC 
Figure 56 

MDRAALLGLARLCALWAALLVLFFYGAQGNWMWLG1ASFGWEKLGCANL 

CKRKPYLIPSIREGARI^IQECGSQFRHERWNCMITAAATTAPMGAS 

AFIYAVMAAGLVHSVTRSCSAGr^ECSCDTTLQNGGSASEGWHWGGCSDDVQYGMWF 

SPJCFII)FPIGNTTGKENKVLLAMNIJINNEAGRQAVAKLM 

TMSSFEOGHLLKDKYENSIQISDKTKRK^^ 

KLGIPGTQGRECNRTSEGADGCMXCCGRGYNTHW 

RCRRCESMTDVHTCK 



Figure 57 

AGTTGAGGGATTGACACAAATGGTCAGGCGGCGGCGGCGGAGAAGGAGGCGGAGGCG 
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CACTAGCGCGGCGCCGCCAGCCGGGAGCCAGCGAGCCGAGGGCCAGGAAGGCGGGAC 
.ACOACCaXXXXXXXXCTA^ 

CAAGTTTCCGCGGCGGCGGCGGCTGCGGTACGCAGAACAGGAGCCGGGGGAGC^CX: 

CGAAAGCGGOTGGGCTCGACGGAGCXXACCCGCGCAGAGG^ 

GGGAGCCGCCGCCGGCCGTGCCCCTGGCAGCCCCAGCGGAGCGGCGCCAAGAGAGGA 

GCCGAGAAAGTATGGCTGAGGAGGAGGCGCCTAAGAAGTCCCGGGCCGCCGGCGGTC 

GCGKX3AGCTGGGAACTTTGTGCCGGGGCGCTCTC 

CGGGGACGCCGGTGGCCGCCGCCGCCCGCCAGTTGACCCCCGGCGATTGGCGCGCCAG 

CTGGTGCTGCTGCTTTGGCTGCTGGAGGCTCCGCTGCTGCTGGGGGTCCGGGCCCAGGC 

GGCGGGCCAGGGGGCAGGCCAGGGGCCCGGGCCGGGGCAGCAACCGCCGCCGCCGC^ 

TCAGCAGCAACAGAGCGGGCAGCAGTACAACGGCGAGCGGGGCATCTCCGTCCCGGA 

CCACGGCTATTGCCAGCCCATCTCCATCCCGCTGTGCACGGACATCGCGTACAACCAG 

ACCATCATGCCCAACCTGGTGGGCCACACGAACCAGGAGGACGCGGGCCT(K3AGGTC 

ACCAGTTCTACCCTCTAGTGAAAGTGCAGTGTTCCGCTGAGCTCAAGTTCTTCCT 

TCGATGTACGCGCCCGTGTGCACCGTGCTAGAGCAGGCGCTGCCGCCCTGCCGCTCCCT 

GTGCGAGCGCGCGCGCCAGGGCTGCGAGGCGCTCATGAACAAGTTCGGCTTCCAGTGG 

CCAGACACGCTCAAGTGTGAGAAGTTCCCGGTGCACGGCGCCGGCGAGCTGTGCGTGG 

GCCAGAACACGTCCGACAAGGGCACCCCGACGCCCTCGCTGCTTCCAGAGTOTGGAC 

CAGCAACCCTCAGCACGGCGGCGGAGGGCACCGTGGCGGCTTCCCGGGGGGCGCCGG 

CGCGTCGGAGCGAGGCAAGTTCTCCTGCCCGCGCGCCCTCAAGGTGCCCTCCTACCTCA 

ACTACCAGTTCCTGGGGGAGAAGGACTGCGGCGCACCTTGTGAGCCGACCAAGGTGTA 

TGGGCTCATGTAGTTCGGGCCCGAGGAGCTGCGCTTCTCGCGCACCTGGATTGGCATTT 

GGTCAGTGCTGTGCTGCGCCTCCACGCTCTTCACGGTGCTTACGTACCTGGTGGACATG 

CGGCGGTTCAGCTACCCGGAGCGGCCCATCATCTTCTTGTCCGGKIITGTTACACCXXXG^ 

GGCCGTGGCCTACATCGCCGGCTTCCTCCTGGAAGACCGAGTGGTGTG^ 

TTCGCCGAGGACGGGGCACGCACTGTGGCGCAGGGCACCAAGAAGGAGGGCTGCACC 

ATCCTCnTCATGATGCTCTACTTCTTCAGGATGGCCAGC 

TCGCTCACCTGGTTGCTGGCGGCTGGCATGAAGTGGGGCCACGAGGCCATCGAAGCCA 
ACTCACAGTATTTTCACCTGGCCGCCTGGGCTGTGCCGGCCATCAAGACCATCACCATC 
CTGGCGCTGGGCCAGGTGGACGGCGATGTGCTGAGCGGAGTGTGCTTCGTGGGGCTTA 

ACAACGTGGACGCGCTGCGTGGCTrCGTGCT^ 

cxx:acgtccittctgctggccggctttgtgtcgctcttccgcatccgcaccatcatgaa 

GCACGATGGCACCAAGACCGAGAAGCTGGAGAAGCTCATGGTGCGCATTGGCGTCTTC 

AGCGTGCTGTACACTGTGCCAGCCACCATCGTCATCGCCTGCTACTTCTACGAGCAGGC 

CTTCCGGGACCAGTGGGAACGCAGCTGGGTGGCCCAGAGCTGCAAGAGCTACGCTATC 

CCCTGCCCTCACCTCCAGGCGGGCGGAGGCGCCCCGCCGCACCCGCCCATGAGCCCGG 

ACTTCACGGTCTTCATGATTAAGTACCTTATGACGCTGATCGTGGGCATCACGTCGGGC 

TTCTGGATCTGGTCCGGCAAGACCCTCAACTCCTGGAGGAAGTTCTACACGAGGCTCA 

CCAACAGCAAACAAGGGGAGACTACAGTCTGAGACCCGGGGCTCAGCCCATGCCCAG 

gcctcggccggcxx:gcagcgatcccccaaagccagcgccgtggagttcgtgccaatcc 

TGACATCTCGAGGrrTGCTCACTAGACAACTCTCirrCGCAGGCTCCriTGAACAACTC 
AGCTCCTGCAAAAGCTTCCGTCCCTGAGGCAAAAGGACACGAGGGCCCGACTGCCAGA 
GGGAGGATGGACAGACCTCTTGCCCTCACACTCTGGTACCAGGACTGTTCGC^ATG 
ATTGTAAATAGCCTGTGTAAGATTTTTGTAAGTATATTTGTATTTAAATGACGACCGAT 

CACGCGTTTTTCTTmCAAAAGTTTTTAATTATTT 

tttccttcttgcccttttcggagtattcx:aaaggagctaaa^^ 

AGCGCTCCTGGTCGTCCTCGCGCGCCTCTCCCTACCACGGGTGCTCGGGACGGCTGGGC 
GCCAGCTCCGGGGCGAGTTCAGCACTGCGGGGTGCGACTAGGGCTGCCK:TGCCAGGGT 

CACTTCCCGCCTCCTCCTTTTGCCCCCHGCCCCTCCTTCT^ 

GCTTGAGGTAGGGGCTCTTAAGGTACAGAACTCCACAAACCTTCCA^ 

GGCCCCCATACATTACAATTCCTCCCTTGCTCGXX^GGTGGATTGCG^ 
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atS^gtccagcgtcatggaaaga^ 

P^^^w^^^^^^CCTCA^^TGAAGTGCTATTTTCTrATriTrAATCAAATAACTA 
CATA^AA^ 

GTGCCTGAAAATAATAGAGCTTGAGCTGTCTGAACTATTTTACATTTTATGGTGTC 
TAGCCAATCCCACAGTGTAAAAATTCA 

Figure 58 

hMEBEAPKKSI^GGGASWEI^G^ 

a^SSo^^^GIWSVLCCASTLFrVLTYLVDMl^^^m^T 



^^^S^AIPCPHI^AGGGAPPHPPMSPDFrVFMKYIMTlJVGmGFWI 
WSGKTLNSW RKFYTRLTNSKQGETTV 



Figure 59 

Saag^^^ 



IGCTGCIGGTGCCCG 

GGCTTCTGCCA( 
CATGCCCAACC 

TC rcCTCCAT<££S 



GGTGGACCAGTTCTATCCGCTGGTGAAGGTGCAGTGCTCGCCCGAACT 



ACCAGACCATCATGCC' 



GTGGGCCGAGCGCCTGCUU l /• a/ ^r>rvvinrjr , T A nr? A CC A CCGCGCCGCCGC 
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CGGGACTGCAGCCGGGTGCCGGGGGCACCCCGGGTGGCCCGGGCGGCG^G^GCTC 
rMGCGCTACGCCACGCTGGAGCACCCC™^ 

^CCCG^ 

tagaS^^^ 

accatggtgtcggtggcctacatcgcgggcttcgtgctccaggagcgcgtggtgtgca 

ACGAGCGCTTCTCCGACK5ACGGTrACCGCACGGTGGTGCAGGGCACCAAGAAQGAGG 

gc?gcaccatcctcttcatgatgctctaot 

CTcScCTGTcSScCTGGTTGCT^ 

CATCACCATCCTGGGCATGGGCCAGATCGACGGCGACCTGCTGAGCGGCGTGTGCTTC 

GTAGGCCTCAAC^CCTG^CCCG^ 

OTG^TCTC^^ 

^ATCATCA^^^^^^ 

C^GTOTCTGC^ 

CGAGCA^^ 

rATGATCAAATACCTCATGACGCTCATCGTGGGCATCACGTCGGGCTTCTGGATCTGGT 

CGGTGAGACCACCGTGTGAGGGACGCCCCCAGGCCGGAA^ 

GCCCGGGGTGGGGCCCCTACAGACTCCGTATTTTATTTTTTTAAATAAAA^ 

AACCATTTCACTTTTAGGTTGCTTTTTAAAAGAGAACTCTCTGCCCAACACCCCC 



Figure 60 

FT^SEDGYRTWQGTJOT^ 
FT T AGIA/SH^TIMKHDGTKTE^ 
YTRLTNSRHGETTV 
Figure 61 

GCCGCTCCGGGTACCTGAGGGACGCGCGGCCGCCCGCGGCAGGCGGTGCAGCCCCCCC 

CCACCCCTTCGAGCCAGGCGCCGGGGTCTGAGGATAGCA 

TGGAG^ACCTGTAACCTGAGATATITCAGTTGAA 

TGGAATCTGTGGTTTGGGAATGTGGTTGATCAACTTGATATGTTGGCCAAATGTGCCCC 

ATOTAATAAAATGAAAAGAAGAGACAAG^ 

AAAACAAACGCCTITrGTGAGACCAAGCTAACAAACCTCTGAC 

TAACTGTTTGAAGAAITrAACAGTAAGATACAGAAGAA^ 

GCAGGTGTATAAATATCTAAAATACATATTGAATAGGCC^^ 

AGACCCAGGAAGGATGGCTATGACTT(3GATTG^^ 

TCATGGGGCATATAGGTGGGCACAGTTTGTTTTCOT 

TGCCAAGATTTGCCTTATAATACTAC^ 



WO 02/077204 



42/66 



PCT/GB02/01195 



GAGCCATATCCTCGACTTGTG<^TCTGAAmAGCTG^ 

agScacS^g^^ataatam^g^^c^at^ 
aaaca1S™aca^aacgcScagagaatat<^ 

T^rATGAO^SATCACACATGGCACCAGCATGAATCGGGTTATTGAAGAAGATGGA 
TTATTCTTTGCCTTTTGCATGACrGATAGCn'GTACT 



CTACAGAlTCIxmX^CTGGAAAGGTAM 
GAACATCAAGGCATCCAAAACACTAAGAATT^ 



:tttc 

t a GGTT VFGA Ajl AGATAA^ATTTCTCT^ 

ACTG^TAAAGAGATGGGCATTGmCCCOA^ 
AAATATTACTTnTCTGGCTGTGrrmA^ 

TAAmGTA(3CCATCrmCCCATGTMTAGTATTGATTCATAGAGA^^ 
AATTTGCTTTGTGGAGGCATGTAATAAGATAAACATCATACATTATAAGGTAACCACA 

ATTACAAAATGGCAAAACA 



Figure 62 
MAMTWIWSLWPLT^ 

MEMFGVP 



.TVmGmGGHSIJSCEPITLRMCQDLPYNTTTMPl^LNHYDQQTAAL 
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WPEDMECSKFPDOT 

iSSw^WKWGSEAIEKKALLFHASAW 

LVVIGCYFYEQAYR^ 
W^SKKTCFEWA^^^ 

sSTQI^VDDQRSKAGSmSKVSSYHGSimSRDGRTO 
LTDHSPJKSSHRLbffiQSPJISSIRDI^^ 

Figure 63 

GCTGCGCAGCGCTGGCTGCTGGCTGGCCTCG 

^Sg^ctc^g^ 

cagctgcagttcttcctitgttctgtttatgtgccaatgtgca^ 
ccccattggcccatgcggcggcatgtgtctitcagtcaagag 

TGAAGGAATTTGGATtTGCCTGGCCAGAGA 

gaacgaccacaaccacatgtgcat^^ 

CACAAAACCCCCATCCAGCCTGGGGAAGAGTGTCACTCTGTGGGAACCAATTCTGATC 

actacaTctgggtgaaaaggagcct^^ 

OTATACAGCCGCTCA^CAA^ 
^GTGmCATCTCCACTGCCTTCACA^ 

CCTACCCTGAGCGCCCCATCATATTTCTCAGTATGTGCTATAATAm 

ATATTGTCAGGCTGACTGTAGGCCGGGAAAGGATATC 

AGAACCTGTTCTCATCCAAGAAGGACITAAGAACACAGGATGTGCAAT^ 

TGATCTACTTTTTTGGAATGGCCAGCTCCATTTG 
^mCAGCAGG^ 

GGTGGATGCAGATGAACTGACTGCiCTTGT^ 
CTCACCGGGTTCGTGGTGGCTCCCCTCTTTACTTA^ 

r^TGCAGGm^ 
A^A^ 

AGTTCCTGCAACGTGTGTGATTCXX^^ 

Trrrr T TATTrTGCAGATGATTCCAACATGGCTGTrGAAATGTrGAAAACrTTTATGTCTrT 
AGAAGTGTTCCAACAGATTGGTGAATTCTGGAAAGGTAAA^^ 

ATGG1TGGGTGAAGCCTGGAAAAGGCAGTGAGACTGTGGTATAAGGCWGTCAGCCTC 

^^^^rrA^^K3G^^GCCAGTTAGAGGCTTTCAAAGCTGTGAAAAATCAAA 
AAGTCCAGGGCGATACTGrnTCCCCTGCA(3GGT^ 
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TTTTrTTrATTTTATGTTAATGACTCAAAAAAGGTATTTTTATAG 
rjCATGCTC 

^^^^^^^^^[TACTAAGTTGACTTACCTGACGGACCCCAGAGACCTA^C^AT 
TGAGCAGTCG^ 

TGGAAGAGCTGAAATGTTAAGTITCTTGGCA^ 

AATTTTTGTGTGTCAATTACAATTAAAAG^ 

AACTGACTTTAAAACTATGGTCAACT^ 

A A ATTTTTTTATTTTTTAAAGCATAAGAATGTTATCAGAATCTGGTCTACT^ 

Tr t a A TTGTTTrCTAGTAATTGGCAAAGGCTCCTTGTAAGATTTC ACTGGAGGCAGTGT 

TrTGACATCAAGATGCATGTAAAGTCQATrQTATGTTTTGAAGGCAAAGTCITGGCITT 

tgIgI^^a^gcsgcacaggtggcccctgctgctgixkccagt^ 

™SA^CTAGGTCAG<3CTCCAGGA<3CATGAGAAnG^ 
AAGAGCAC1T*3G<3TX3AATCTGGGCACCTGA^ 

Ga=^=^^^s 

CTGCTGAACITTACTGTTCCTCTG^ 

TT^r^GT^CCATTCCATAAATCTCTTCCTGTGTGAGCCACCACACCCAGCCTGGGTCT 

CTcnnocAAOCioCTiTcra^ 

GGGCAAGGGAACATXKKK3GTAGA<H<3<3TGCrGG<X 
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TGAACTGGCAACTTACITGC^^ 
OTg£<^^ 

AATAGACTCAGGGATTTCACCAGGTCGGTGCAGTATIT 
AGCTGCAGTTAGGAAGGGAGCCATTGAGCACAGACm 

TTCTCCTGCCACAGCeTCCTGAGGAGCTGGGACTACAGGTGCGTGCTACCACGCCCAG 
CTCCTGACCTCATGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAAGTCTGA 

ScaSacacctggcctggaaggaacctot 

CTGTGATGGAGGACACTGGAGAGAGTTGCTATTCCAGTCAATCAIXjTC 
ACTCTGAAAATCCTATTGGTTCCTTTATTTTATTTGAGTTTAGAGTTGCCTTCT^ 

GTA^ATGTCTGGCAAATGACCTGG^ 

CCTGGAAACTCCTTAGAGAGCATTTTGCTC 

ATAATAGATITCATITCACrCTAGCCTACATAGAGCmCTOT 

CACTTGTGCGGTGATTACACACTTGACAGTACCAGGAGACAAATGAC^ 

CC^GACATGCCTCTTCCCCTTGGCAAGCTCAGTTGCCCTGATAGTAGCATGTTTCTGTTTC 

TTTGTAGAGGACCTTTTTGGGGTCCTATATGAGCCATGTC 

otgctctcctacaatat^ 

AGGGGCAAGAGGCACTCTGCGCCACAAAGGTTGGATCCATCITCTCrCCGAGGTTCTG 

a^g^aaattgtactaa^^ 

GGTAAGCTGCTTTCTAGATCTCrCCCAGTGAGGCATGGAGGTGm^ 

ACCTCACAGGGATGTTGTGAGGCTTGAAAAGGTCAAAAAA^ 

TTTGTAAGAAAGGTAGATGAAATATCGGATGTAATCTGAAAAAAAGATAA^ 

TCGCCCTGCTCTGTGCAGCAGTCCSGGCT^ 

CCACCCCACAGCTCCAGGAACOTGAAGCCAATCTGGGGACTTTCAGAT^ 

AGAGGTACCAGGCAAACTTCCTGCTACACATGCC^ 
GGAAATGGACCCTGCTITrAAGGATGTAGAAAAGTATGTCTGCATCGA^ 

TAAATTTCTAATTTATCACTGTACAAAGAAAACCCCTTGCTATTTAATTTT 
GAAAATAAAGTTTTGTTTGTTAAAAAAAAA 



Figure 64 

MAWRGAGPSWGAPGGVGI^LGLLLQLLLLLGPARGFGDEEERRO)Pm^CQ^GW 

TKMPNLVGHELQTDAELQL^ 
SVKFJRCEPVLKEFGFAWPESLNCSKFPPQrTO^ 

VGTNSDQYIWVKRSLNCVLKCGYDAGLYSRSAKEFTDIWMAVWASLCFISTA^ 

SSPJSYPERPIIFI^MCYNIYSIAYIVPXTVGP^PJSCDFEEAAEPVLIQEGOOT^ 

YFFGMASSIWWVILTLTWFLAAGLKWGHEAIEMHSSYFHIAAW 

DELTGLCYVGNQ>nJ)ALTGFWAPIJTYLVIGTLFlAAGLVALF 

RLMVKIGWSVLYTWATCVIACYFYEISNWA1F11YSAD 

SGMWIWSAKSLHTWQKCSNPJ.VNSGKVKEEKRGNGWVKPGKGSETVV 
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Figure 65 



ACCCAGGGACGGAGGACCCAGGCTGGGTTGGGGACTGTCT^ 

CGTGGAGAGTCCTTTCCCTGGAATCCGAGCCCTAACCGTCTCTCCCCAGGCCTATCCGG 

CGAGGAGCGGAGCGCTGCCAGCGGAGGGAGCGCCTrCCCGAAGCAGTTTATCTTTGGA 

CGGTTTTCTTTAAAGGAAAAACGAACCAACAGGTTGCCAGCCGCGGCGCCACACACGA 

GACGCCGGAGGGAGAAGCCCCGGCCCGGATTCCTCTGCCTGTGTGCGTCCCTCGCGGG 

CTGCTGGAGGCGAGGGGAGGGAGGGGGCGATGGCTCGGCCTGACCCATCCGCGCCGC 

CCTCGCTGTTGCTGCTGCTCCTGGCGCAGCTGGTGGGCCGGGCGGCCGCCGCGTGCAA 

GGCCCCGGTGTGCCAGGAAATCACGGTGCCCATGTGCCGCGGGATCGGGTACAACCTG 

ACGCACATGCCCAACCAGTTCAACCACGACACGCAGGACGAGGCGGGCCTGGAGGTG 

CACCAGTTCTGGCCGCTGGTGGAGATCCAATGCTCGCCGGACCTGCGCTTGTTCCTATG 

CACTATGTACACGCCCATGTGTCTGCCCGACTACCACAAGCCGCTGCCGCCCTGCCGCT 

CGGTGTGCGAGCGCGCCAAGGCCGGCTGCTCGCCGCTGATGCGCCAGTACGGCTTCGC 

CTGGCCCGAGCGCATGAGCTGCGACCGCCTCCCGGTGCTGGGCCGCGACGCCGAGGTC 

CTCTGCATOGATTACAACCGCAGCGAGGCCACCACGGCGCCCCCCAGGCCTTTCCCAG 

CCAAGCCCACCCTTCCAGGCCCGCCAGGGGCGCCGGCCTCGGGGGGCGAATGCCCCGC 

TGGGGGCCCGTTCGTGTGCAAGTGTCGCGAGCCCTTCGTGCCCATTCTGAAGGAGTCAC 

ACCCGCTCTACAACAAGGTGCGGACGGGCCAGGTGCCCAACTGGGCGGTACCCTGCTA 

CCAGCCGTCCTTCAGTGCCGACGAGCGCACGTTCGCCACCTTCTGGATAGGCCTGTGGT 

CGGTGCTGTGCTTCATCTCCACGTCCACCACAGTGGCCACCTTCCTCATCGACATGGAC 

ACGTTCCGCTATCCTGAGCGCCCCATCATCTTCCTGTCAGCCTGCTACCTGTGCGTGTC 

GCTGGGCTTCCTGGTGCGTCTGGTCGTGGGCCATGCCAGCGTGGCCTGCAGCCGCGAG 

CACAACCAGATCCACTACGAGACCACGGGCCCTGCACTGTGCACCATCGTCTTCCTCCT 

GGTCTACTTCTTCGGCATGGCCAGCTCCATCTGGTGGGTCATCCTGTCGCTCACCTGGTT 

CCTGGCCGCCGCGATGAAGTGGGGCAACGAGGCCATCGCGGGCTACGGCCAGTACTTC 

CACCTGGCTGCGTGGCTCATCCCCAGCGTCAAGTCCATCACGGCACTGGCGCTGAGCTC 

CGTGGACGGGGACCCAGTGGCCGGCATCTGCTACGTGGGGAACCAGAACCTGAACTCG 

CTGCGGCGCTTCGTGCTGGGCCCGCTGGTGCTCTACCTGCTGGTGGGCACGCTCTTCCT 

GCTGGCGGGCTTCGTGTCGCTCTTCCGCATCCGCAGGGTCATCAAGCAGGGCGGCACC 

AAGACGGACAAGCTGGAGAAGCTCATGATCCGCATCGGCATCTTCACGCTGCTCTACA 

CGGTCCCCGCCAGCATTGTGGTGGCCTGCTACCTGTACGAGCAGCACTACCGCGAGAG 

CTGGGAGGCGGCGCTCACCTGCGCCTGCCCGGGCCACGAGACCGGCCAGCCGCGCGCC 

AAGCCCGAGTACTGGGTGGTCATGCTCAAGTACTTCATGTGCCTGGTGGTGGGCATCAC 

GTCGGGCGTCTGGATCTGGTCGGGCAAGACGGTGGAGTCGTGGCGGCGTTTCACCAGC 

CGCTGCTGCTGCCGCCCGCGGCGCGGCCACAAGAGCGGGGGCGCCATGGCCGCAGGG 

GACTACCCCGAGGCGAGCGCCGCGCTCACAGGCAGGACCGGGCCGCCGGGCCCCGCC 

GCCACCTACCACAAGCAGGTGTCCCTGTCGCACGTGTAGGAGGCTGCCGCCGAGGGAC 

TCGGCCGGAGAGCTGAGGGGAGGGGGGCGTTTTGTTTGK3TAGTTTTGCCAAGGTCACT 

TCCGTTTACCTTCATGGTGCTGTTGCCCCCTCCCGCGGCGACTTGGAGAGAGGGAAGAG 

GGGCGTTTTCGAGGAAGAACCTGTCCCAGGTCTTCTCCAAGGGGCCCAGCTCACGTGT 

ATTCTATTTTGCGTTTCTTACCTGCCTTCTTTATGGGAACCCTCTTTTTAAm 

T 



Figure 66 



WO 02/077204 



47/66 



PCT/GB02/01195 



MARPDPSAPPSIXLI1IAQLVGRAAAASKAPVCQEITWM 
DEAGLEVHQFWPLVEIQCSPDLRFFLCTMYTPICLPDYHKZPLPPCRSVCERAELA^ 

CPAGGPWCKCP^PFVPEXESHPLYNKVRTGQWNCAWCYQPSFSADERT^ 

YETTGPALCTIWLLVYFFGMASSIWWV^ 
IPSVl^ITAIAIisSVDGDPVAGICYVGNQN^ 
RSVIKQGGTKTDKEEKLMIRIGIFTLLYTWASIVVAC^ 

TGQPRAKPEYWVIMIXYIMCLW 
AMAAGDYPEASAALTGRTGPPGPAATYHKQVSLSHV 



Figure 67 

GCAGCTCCAGTCCCGGACGCAACCCCGGAGCCGTCTCAGGTCCCTGGGGGGAACGGTG 
GGTTAGACGGGGACGGGAAGGGACAGCGGCCTTCGACCGCCCCCCGAGTAATTGAGCC 
AGGACTCATTTTCAGGAAAGCCTGAAAATGAGTAAAATAGTGAAATGAGGAATTT 

CATTTTATCTTTGGATGGGGATCTTCTGAGGATGC 

TGGTAAAATCAGGAATTTGAAGAAAATGGAGATGTITACATTTTTGTTGA 

TTCTACCCCTCCTAAGAGGGCACAGTCTCTTCACCTGTGAAC^ 

TGTATGAAAATGGCCTACAACATGACGTITrTCCCTAATCrGATGGGTCA^AIGA 

GAGTATTGCCGCGGTGGAAATGGAGCATTTTCTTCCTCTCGCAAATCTGGAATGTOAC 

CAAACATTGAAACTTTCCTCTGCAAAGCATTTGTACCAACCTGCATAGAACAAA^CAT 

GTGGTTCCACCTTGTCGTAAACTTTGTGAGAAAGTATATTCTGATTGCAAAAAATTAAT 

TGACACTTTTGGGATCCGATGGCCTGAGGAGCTTGAATGTGACAGATTACAATAGTCT 

ATGAGACTG1TCCTGTAACTTTTGATCCACACACAGAATTTCTTGGTCCTCAGAAGAAA 

ACAGAACAAGTCCAAAGAGACATTGGATTTTGGTGTCCAAGGCATCTTAAGAOT 

GGGGACAAGGATATAAGTTTCTGGGAATTGACCAGTGTGCGCCTCCATGCCC^ 

GTATTTTAAAAGTGATGAGCTAGAGTTTGCAAAAAGTTTTATTGGAACAGTTTC 

TTTGTCTTTGTGCAACTCTGTTCACATTCC 

GATACCCAGAGAGACCAATTATATATTACTCTGTCTGTTACAGCATTGTATCTCTTATG 
TACTTCATTGGATrTTTGCTGGGCGATAGCACAGCCTGCAATAAGGCAGATGAGAAGC 
TAGAACITGGTGACACTGTTGTCCTAGGCTCTCAAAATAAGGCTTGCACCGTTTTGTTC 

ATGCTTTTGTATTTTTTCACAATGGCTGGCACTGTGTGGTG 

TGGTTCTTAGCTGCAGGAAGAAAATGGAGTTGTGAAGCCATCGAGCAAAAAGCAGTGT 

GGTTTCATGCrGTTGCATGGGGAACACCAGGTTTCCTGACTGTTATGCTTCTTGCTCTGA 

ACAAAGTTGAAGGAGACAACATTAGTGGAGTTTGCTTTGTTGGCCTTTATGACCTGGAT 

GCTTCTCGCTACTTTGTACTCTTGCCACTGTGCCTTTGTGTGTTTGTTGGGCTCTCTCTTC 

TTITAGCTGGCArTATTTCCTTAAATCATGTTCGACAAGTCATACAA.CATGATGGCCGG 

AACCAAGAAAAACTAAAGAAATTTATGATTCGAATTGGA 

TTGTGCGATTAGTGACACTTCTCGGATGTTAGGTCTATGAGCAAGTGAACAGGATTACC 
TGGGAGATAACTTGGGTGTCTGATCATTGTCGTCAGTACCATATCCCATGTCCTTATCA 
GGCAAAAGCAAAAGCTCGACCAGAATTGGCTTTATTTATGATAAAATACCTGATGACA 
TTAATTGTTGGCATCTCTGCTGTCTTCTGGGTTGGAAGCAAAAAGACATGCACAGAATG 

GGCTGGGTTTTTTAAACGAAATCGCAAGAGAGAT 

CTACAGGAATCATGTGAGTTTTTCTTAAAGCACAATTCTAAAGTTAAACACAAAAAG^A 
AGCACTATAAACCAAGTTCACACAAGCTGAAGGTCATTTCCAAATCCATGGGAA 

CACAGGAGCTACAGCAAATCATGGCACTrCTGCAGTAGCAATT^ 
CTAGGACAAGAAACTTTGACAGAAATCCAAACCTCACCAGAAACATCAATG^ 
GTGAAAGCGGACGGAGCTAGCACCCCCAGGTTAAGAGAACAGGACTGTGGTGAACC^ 
GCCTCGCCAGCAGCATCCATCTCCAGACTCTCTGGGGAACAGGTCGACGGGAAGGGCC 
. . ~. — . ~r~^ , rr,^^ a a k nTn^p-HA^THA a^.a A aa A TT A GTC.C AAAGAGTG 
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ATATTACTGACACTGGCCTGGCACAGAG 
rrAAGCAGC^^ 

aga^aSg^ 

AGCAAATTTGTGTTACACTGGAAGTG^ 

GTTCTTCTTTTGCACTTAAAGTTGCATTGCCTACTGTTATACTGGAA^ 

AAATGTGCAGGTTAATAATATTTTTTTAATAGTGTGGGA^ 
TTCCTTTTCTATTTATGAAGATTCTACTCTrGGTAAGAGTA 
ATTTTACCTTTTTGATATAAAATCAAGATATTTCTTTGCT 

ttgStcttttStacatat 

GAATTTCTAAGAAAATTGTAAAATAGTOT^ 

AATGTGTGATTTTTATAGTCTCGTTTTAGGAATTTCACAGATCTAA^ 

AATAAGGTGCTTACTCAAAGAGTGTCCACTATTGATTGTAT^ 

TTCTGCATATTTAAAATAAAATGTrCCTAAAGGGTTAGTAGACAAA^ 

TATATTAGGCCAAGTGCAATTGACTrCCCT^ 
GTA^TA^ 

GAATATTACATTTTGTATTATACAGTACCTITCTCAGACATTTTGTAG 



Figure 68 

IVffiMFTFIXTCIFLPLLRGHSLFTCEPnTPRCMKMAYmiTFFPNLMGHTO 
SL^ECSPNffiTFLCKAFWTCIEQIHVWPCRKLCEKVYSD 

PNMYFK^ELEFAKSFIGTVSFCLCATIFITLTFLroVRP^ 

G^LG^TAC^EKLELGDTVVLGSQNKACTVLF^ 

AGRKWSCEAIEQKAVWFHAVAWGTPGFLTVMLLALNKVEGDNESGV 

fVlh>lclcvf^^ 

GCYVYEQVNRITWEITWVSDHCRQYHIPCPYQAK 

VFWGSKXTCTEWAGFFKIWPJ^^ 

T K^SK^MGTSTGATANEGTS AVAITSHDY^ 

q^cg^^ 

epsslkgstsllvhpvsgvrkeqgggchsdt 



Figure 69 

====== 

ACTCCGCTTTTTCTTATGCTCCATGTAT(^ 
CCCGCCGTGTCGTTCTCTGT^ 

AAGTTCGGCTTCCAGTGGCCCGAGCGGCTGCGCTGCGAGAACTTCCCGGTGCACGGTG 

CGGGCGAGATCTGCGTGGGCCAGAACACGTCGG^ 
GCCCCACTGCCTACCCTACC^^^^^ 
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CAA(KJ1XKXXXX»TACCTKKKOT 

GCGAACCGGGCCGTGCCAACGGCCTGATGTACTTTAAGGAGGAGGAGAGGCGOTCGC 

CCGCCTCTGGGTGGGCGTGTGGTCCGTGCTGTGCTGCGCCTCGACGCTCTTTACCGTTG 

TCACCTACCTGGTGGACATGCGGCGCTTCAGCTACCCAGAGCGGCCCATCATCCTCCTG 

TCGGGCTGCTACTTCATGGTGGCCGTGGCGCACGTGGCCGGCrrCCTTCTAGAGGACCG 

CGCCGTGTGCGTGGAGCGCTTCTCGGACGATGGCTACCGCACGGTGGCGCAGGGCACC 

AAGAAGGAGGGCTGCACCATCCTCTTCATGGTGCTCTACTTCTTCGGCATGGCCAGCTC 

CATCTGGTGGGTCATTCTGTCTCTCACTTGGTTCCTGGCGGCCGGCATGAAGTGGGGCC 

ACGAGGCCATCGAGGCCAACTCGCAGTACTTCCACCTGGCCGCGTGGGCCGTGCCCGC 

CGXCAAGACCATCACTATCCTGGCCATGGGCCAGGTAGACGGGGACCTGCTGAGCGGG 

GTGTGCTACGTTGGCCTCTCCAGTGTGGACGCGCTGCGGGGCTTCGTGCTGGCGCGTCT 

GTTCGTCTACCTCTTCATAGGCACGTCCTTCTTGCTGGCCGGCTTCGTGTCCCTCTTCCG 

TATCCGCACCATCATGAAACACGACGGCACCAAGACCGAGAAGCTGGAGAAGCTCAT 

GGTGCGCATCGGCGTCTTCAGCGTGCTCTACACAGTGCCCGCCACCATCGTCCTGGCCT 

GCTACTTCTACGAGCAGGCCTTCCGCGAGCACTGGGAGCGCACCTGGCTCCTGCAGAC 

GTGCAAGAGCTATGCCGTGCCCTGCCCGCCCGGCCACTTCCCGCCCATGAGCCCCGACT 

TCACCGTCTTCATGATCAAGTACCTGATGACCATGATGGTCGGCATCACCACTGGCTTC 

TGGATCTGGTCGGGCAAGACCCTGCAGTCGTGGCGCCGCTTCTACCACAGACTTAGCC 

ACAGCAGCAAGGGGGAGACTGCGGTATGAGCCCCGGCCCCTCCCCACCTTTCCCACCC 

CAGCCCTCTTGCAAGAGGAGAGGCACGGTAGGGAAAAGAACTGCTGGGTGGGGGCCT 

GTTTCTGTAACTTTCTCCCCCTCTACTGAGAAGTGACCTGGAAGTGAGAAGTTCTTTGC 

AGATTTGGGGCGAGGGGTGATTTGGAAAAGAAGACCTGGGTGGAAAGCGGTTTGGAT 

GAAAAGATTrCAGGCAAAGACTTGCAGGAAGATGATGATAACGGCGATGTGAATCGTC 

AAAGGTACGGGCCAGCTTGTGCCTAATAGAAGGTTGAGACCAGCAGAGACTGCTGTGA 

GTTTCTCCCGGCTCCGAGGCTGAACGGGGACTGTGAGCGATCCCCCTGCTGCAGGGCG 

AGTGGCCTGTCCAGACCCCTGTGAGGCCCCGGGAAAGGTACAGCCCTGTCTGCGGTGG 

CTGCTTTGTTGGAAAGAGGGAGGGCCTCCTGCGGTGTGCTTGTCAAGC AGTGGTCAAA 

CCATAATCTCTTTTCACTGGGGCCAAACTGGAGCCCAGATGGGTTAATTTCCAGGGTCA 

GACATTACGGTCTCTCCTCCCCTGCCCCCTCCCGCCTGTTTTTCCTCCCGTACTGCTTTC 

AGGTCTTGTAAAATAAGCATTTGGAAGTCTTGGGAGGCCTGCCTGCTAGAATCCTAATG 

TGAGGATGCAAAAGAAATGATGATAACATTTTGAGATAAGGCCAAGGAGACGTGGAG 

TAGGTATrrTTGCTACTTTTTCATTITCTGGGGAAGGCAGGAGGCAGAAAGA 

TTrATTrGGTCTAATACCCTGAAAAGAAGTGATGACTTGTTGCTTTTCAAAACAGG 

GCATTTTTCCCeTTGTCTITGTTGTAAGAGACAAAAGAGGAAACAAAAGTGTCTCCCTG 

TGGAAAGGCATAACTGTGACGAAAGCAACTTTTATAGGCAAAGCAGCGCAAATCTGAG 

GTTTCCCGTTGGTTGTTAATTTGGTTGAGATAAACATTCCTTTTTAAGGAA 

AGCAGTGTGCTGTCACACACCGTTAAGCCAGAGGTTCTGACTTCGCTAAAGGAAATGT 

AAGAGGTTTTGTTGTCTGTTTTAAATAAATTTAATTCGGAACACATGATCCAACAGACT 

ATGTTAAAATATTCAGGGAAATCTCTCCCTTCATTTACTTTTTCTTGCTATAAGCCTATA 

TTTAGGTTTCTTTTCTATTTTT^ 

GTCTTCAGCCTCATAATAAAGGAAAGTTAATTAAAAAAAAAAAGCAAAGAGCCATTTT 

GTCCrGTTTTCTTGGTTCCATCAATCTGTTTATTAAACATCATCCATATGCTGACCCTGT 

CTCTGTGTGGTTGGGTTGGGAGGCGATCAGCAGATACCATAGTGAACGAAGAGGAAGG 

TTTGAACCATGGGCCCCATCTTTAAAGAAAGTCATTAAAAGAAGGTAAACTTCAAAGT 

GATTCTGGAGTTCTTTGAAATGTGCTGGAAGACTTAAATTTATTAATCTTAAATCATGT 

ACTTTTTTTCTGTAATAGAACTCGGATTCTTTTGCATGATGGGGTAAAGCT^ 

AATCATGGGAGCTAACCTTTATCCCACCmGACACTACCCTCCAATCTTGCAACACTA 

TCCTGTTTCTCAGAACAGTTTTTAAATGCCAATCATAGAGGGTACTGTAAAGTGTACAA 

GTTACTTTATATATGTAATGTTCACTTGAGTGGAACTGCTTTTTACATTAAAGT^ 

CGATCTrGTGTTTCTrCAACCTTCAAAACTATCTCATCTGTCAGATTTrrAAAACTCCAA 

CACAGGTTTTGGCATCTTTTGTGCTGTATCTTTTAAGTGCATGTGAAATTTGTAA^^ 
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TATTTATACATTTTTACTTTGGATTTTTGTTCT 
TCACATGTACAGATCACAAATAAATTTTTTTAAATAC 

Figure 70 

MRDPGAAAPI^SLGLCALVIALLGAI^AGAGAQPYHGEKGISWDHGFCQPISIPirTDlAY 

NQmPI^LGHTNQEDAGLEVHQFYPL^^ 
ERARQGCEAIJVINKFGFQWPEPJJ^^ 

LPDLPFT ALPPGASDGRGPvPAFPFS CPRQIJKVPPYLGYRFLGERDCGAPCEPGRANGLMYF 
K£EEFJtfAPXWGWSVLCCASTH?TVL^ 

GFLF EDRAVCVERFSDDGYRTV AQGTKKEGCTD^FMVLYFFGMASSIWWVILSLTWFLAA 
GMKWGHE \IEANSQYFHLAAWAWAVKTTTILAMGQVDGDLLSGVCYVGLSSVDA 
LRGFVLAPLFVTLFIGTSFLLAGF^SLFRIRTIMK 
IVLACYFYEQAFREHWERTWLLQTCKSYAWCPPG^ 
GFWIWSGKTLQSWRRFYHPJ^HSSKGETAV 

Figuxe71 

ACAGCATGGAGTGGGGTTACCTGTTGGAAGTGACCTCGCTGCTGGCCGCCTTGGCGCT 

GCTGCAGCGCTCTAGCGGCGCTGCGGCCGGCTCGGCCAAGGAGCTGGCATGCCAAGAG 

ATCACCGTGCCGCTGTGTAAGGGCATCGGCTACAACTACACCTACATGCCCAATCAGTT 

CAACCACGACACGCAAGACGAGGCGGGCCTGGAGGTGCACCAGTTCTGGCCGCTGGTG 

GAGATCCAGTGCTCGCCCGATCTCAAGTTCTTCCTGTGCAGCATGTACACGCCCATCTG 

cctagAggactacaagaagccgctgccgccctgccgctcggtgtgcgagcgcgccaag 

GCCGGCTGCGCGCCGCTCATGCGCCAGTACGGCTTCGCCTGGCCCGACCGCATGCGCT 

GCGACCGGCTGCCCGAGCAAGGCAACCCTGACACGCTGTGCATGGACTACAACCGCAC 

CGACCTAACCACGGCCGCGCCCAGCGCGCCGCGCCGCCTGCCGCCGCCGCCGCCCGGC 

GAGCAGCCGCC1TCGGGCAGCGGCCACGGCCGCCCGCCGGGGGCCAGGCCCCCGCACC 

GCGGAGGCGGCAGGGGCGGTGGCGGCGGGGACGCGGCGGCGCCCCCAGCTCGCGGCG 

GCGGCGGTGGCGGGAAGGCGCGGCCCCCTGGCGGCGGCGCGGCTCCCTGCGAGCCCG 

GGTGCCAGTGCGGCGCGCCTATGGTGAGCGTGTCCAGCGAGCGCCACCCGCTCTACAA 

CCGGGTCAAGACAGGCCAGATCGCTAACTGCGCGCTGCCCTGCCACAACCCCTTTTTCA 

GCCAGGACGAGCGCGCCITCACCGTCTrCTGGATCGGCCTGTGGTCGGTGCTCTGCTTC 

CTGTCCACCTTCGCCACCGTCTCCACCTrCCITATCGACATGGAGCGCTTCAAGTACCC 

GGAGCGGCCCATTATCTTCCTCTCGGCCTGCTACCTCTTCGTGTCGGTGGGCTACCTAG 

TGCGCCTGGTGGCGGGCCACGAGAAGGTGGCGTGCAGCGGTGGCGCGCCGGGCGCGG 

GGGGCGCTGGGGGCGCGGGCGGCGCGGCGGCGGGCGCGGGCGCGGCGGGCGCGGGCG 

CGGGCGGCCCGGGCGGGCGCGGCGAGTACGAGGAGCTGGGCGCGGTGGAGCAGCACG 

TGCGCrACGAGACCACCGGCCCCGCGCTGTGCACCGTGGTCTTCTTGCTGGTCTACTTC 

TTCG<5CATGGCCAGCTCCATCTGGTGGGTGATCTTGTCGCTCACATCKjTTCCTGGCC}GC 

CGGTATGAAGTGGGGCAACGAAGCCATCGCCGGCTACTCGCAGTACTTCCACCTGGCC 

gcgtggcttgtgcccagcgtcaagtccatcgcggtgctggcgctcagctcggtggacg 

GCGACCCGGTGGCGGGCATCTGCTACGTGGGCAACCAGAGCCTGGACAACCTGCGCGG 

cttcgtgctggcgccgctggtcatctacctcttcatcggcaccatgttcctgctggccg 

gctrcgtgtccctgttccgcatccgctcggtcatcaagcaacaggacggccccaccaag 

acgcacaagctggagaagctgatgatccgcctgggcctgttcaccgtgctctacaccg 

TGCCCGCCGGGGTGGTGGTCGCCTGCCTCTTCTACGAGCAGCACAACCGCCCGCGCTG 
GGAGGCCACGCACAACTGCCCGTGCCTGCGGGACCTGCAGCCCGACCAGGCACGCAG 
GCCCGACTACGCCGTCTTCATGCTCAAGTACTTCATGTGCCTAGTGGTGGGCATCACCT 
CGGGCGTGTGGGTCTGGTCCGGGAAGACGCTGGAGTCCTGGCGCTCCCTGTGCACCCG 
CTGCTGCTGGGCCAGCAAGGGCGCCGCGGTGGGCGGGGGCGCGGGCGCCACGGCCGC 



■» ^ *-i /*> /-» /~* /~\ f 



WO 02/077204 



51/66 



PCT/GB02/01195 



GGCCGGGCGGCGGCGGCKjGCTCCCTCTACAGOT 

GTCGGGCACGGCGAGCTCCGTGTCTrATCCAAAGCAGATGCCATrGTC(XAwl^l^^ 



GCGGAGGGGAGGGGGCCKX)CAGGAGGGGTGGGGA<^GGGGCG 

tt A A ATTnTAAAAAAAAAAAGTGTAAjVATATGTATATATCCAAAGATATAGTQjUjl AL, 

TTGTCAATAAAATGACTTTTGATAAATGATTTAACCATTGCC 
GA^CTGTCACCTTTAAAG^^ 

CTTGTCArrCTGTACACTGACCTTAGGCATGGAGAA^ 
TTAAGTTGTTAGCCAAGTAAATATCATTGTTGAACPGAAATC^ 

CCTTCCCCAAAGACGGTGTTTTTCATGGGAGCTC^^ 

TCACTITAGTGGATGTAAATGGAACITCTGCAAGGCAOTAATTCC 

ATTTATCCTGK3ATGGTATCACTAAAGGTTTCAAAACCCTGAAAAAAAA 



Figure 72 

imrrYT t pvTSLLAALALLOP^SGAAAAJJAK^^ 

aagagaggpggrgeyeelgaveqimyettgpalct^ 
wlaagmkwgneaiagysqyfhl^^ 



v^sgktles^lc^ 

GGGGGSLYSDVSTGLTWRSGTASSVSYPKQMPLSQV 
Figure 73 



CCGCCTTCGGCCCGGGCCTO 



CCGGGATGGCCGTGGCGCCTCTGCGGGGGGCGCTGCTG 



CGCGGGGCTG 
^CCTGACCCG< 
3CTAGCGGAG 

AGTTCAACTTCGGCTGGCC! 
CCCGCACGCGCTGTGCATGG. 



AGCOCGGGCXXXX^^ 
.CC 

.dcmaiccOTSSSSSSS^^ 



GGACTCGCTCGACTGCGCCCGGCTGCCCACGCGCAACGA 
AGGCGCCCGAGAACGCCACGGCCGGCCCCGCGGAGCCC 



naAOAnr. 
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TGGGCCCGGGCGCGC^CC^AGTGGCACCTGCG^ 

GGAGAAGA(}CCGCTCGTGC<3CACCGCGCTG^ 

CGGCGCGACAAGGACTTCGCGCTGGTCTGGAT^ 

GATCCAGGA(3GGCCrrGGAGAACAC(3GGCTGCACGCTGGTOT^ 
rrCGGCATG<3CCAGCTCGCTCT^ 

SaSTg^^^ 

GCCACCTGCGTCATCGTTTGCTATGTCTACGA^ 

TCGGGCCACAGAGCAGCCATGCGCAGCGGCCGCGGGGCQCG^ 

CTCGCTGCCAGGGGGCTCGGTGCCCACCGTGGCGGTCTTCArcCTC^ 

CACrGGTGGTGGGGATCACCAGCGGKJGTCTG 
CTG^GAGCCTGTC^ACCGCAAGATAGCAGCT^ 
CGCGCCCCCGK3GAGCTACGGACGTGGCACGCACTGCCA_CTATAAGG^ 

3ACG 

CCCCTAGAGACACXTCACTAGCAGCTGCCC^ 



TCTIWACATGACTAAGACGGACCCCTC^^ 

TGGGAAGGGGGAAGGTAGGAGGTGAGGC 
Figure 74 

MAVAPIJlGAIXLWQLLAAGGAALEIGRFDPERGRGAAPCQAVEff 

^BFAPLVQYGCHSH ^ „ 7 1 n _ _ , D 

CAPMQFOTGWPDSIJX^A^^ 



isASsd==s^^s=2 

EQrCAAAAGPGGPJRDCSUPGGSVPTVANO^^ 
YRIQAAGRARAKACRAPGSYGRGTHCHYKAPTVV1HMTKTDPSLENPTHL 



Figure 75 

ACAamX^OGCCAGCATOCAG^ 

GATGGGCTCGTGCGCCGCCATCAGGTCCATGGACATG^GAGCGCCCGGGCG^ 
TGCCAGCCCATCGAGATCCCGATGTGCAAGGACATCGGCTAG 

ccaacctgatgggccacgagaacca<x:gcgaggca^^ 

GCCGCTGGTGGAGTACGGCTGCCACGGCCACCTCC^^ 
CGCCGATGTGCACCGAGCAGGTCTCTACCCCCATCCCCGCCTGCCGGGT^ 

~^^^~-n^» a /-"r/^«~"f /-/"Trr} a tt A TfifrA GC. A GTTCAACTTCAAGTGGCUUijAU 
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TCCCTGGACTGCCGGAAACTCCCCAACAAGAACGACCCCAACTACCTGTGCATGGAGG 

CGCCCAACAACGGGTCGGACGAGCCCACGCGGGGCTCGGGCCTGTTCCCGCCGCTGTT 

CCGGCCGCAGCGGCCCCACAGCGCGCAGGAGCACCCGCTGAAGGACGGGGGCCCCGG 

GCGCGGCGGCTGCGACAACCCGGGCAAGTTCCACCACGTGGAGAAGAGCGCGTCGTG 

CGCGCCGCTCTGCACGCCCGGCGTGGACGTGTACTGGAGCCGCGAGGACAAGCGCTTC 

GCAGTGGTCTGGCTGGCCATCTGG^CK5TGCTGTGCTrCTTCTCCAGCGCCTTCACCGT 

GCTCACCTTCCTCATCGACCCGGCCCGCTTCCGCTAGCCCGAGCGCCCCATCATCTTCC 

TCTCCATGTGCTACTGCGTCTACTCCGTGGGCTACCTCATCCGCCTCTTCGCCGGCGCC 

GAGAGCATCGCCTGCGACGGGGACAGCGGCCAGCTCTATGTCATCCAGGAGGGACTGG 

AGAGCACCGGCTGGACGCTGGTCTTCCTGGTCCTCTACTACTTCGGCATGGCCAGCTCG 

CTGTGGTGGGTGKjTCCTCACGCTCACCTGGTTCCTGGCCGCCGGCAAGAAGTGGGGCC 

ACGAGGCCATCGAAGCCAACAGCAGGTACTTCCACCTGGCAGCCTGGGCCATCCCGGC 

ggtgaagaccatcctgatcctggtcatggg<:agggtggcgggggacgagctcaccggg 

GTCTGCTACGTGGGCAGCATGGACGTCAACGCGCTCACCGGCTTCGTGCTCATTCCCCT 

GGCCTGCTACCTGGTCATCGGCACGTCCTTCATCCTCTCGGGCTTCGTGGCCCTGTTCC 

ACATCCGGAGGGTGATGAAGACGGGCGGCGAGAACACGGACAAGCTGGAGAAGCTCA 

TGGTGGGTATCGGGCTCTTCTCTGTGCTGTACACCGTGCCGGCCACCTGTGTGATCGGC 

TGGTACTITrACGAACGCCTCAACATGGATTACTGGAAGATCCTGGCGGCGCAGCACA 

AGTGCAAAATGAACAACCAGACTAAAACGCTGGACTGCCTGATGGCCGCCTCCATCCC 

CGCCGTGGAGATCTTCATGGTGAAGATCTTTATGCTGCTGGTGGTGGGGATCACCAGGG 

GGATGTGGATTTGGACCTCCAAGACTCTGCAGTCCTGGCAGCAGGTGTGCAGCCGTAG 

GTTAAAGAAGAAGAGCCGGAGAAAACCGGCCAGCGTGATCACCAGCGGTGGGATTTA 

CAAAAAAGCCCAGCATCCCCAGAAAACTCACCACGGGAAATATGAGATCCCTGCCCAG 

TCGCCCACCTGCGTGTGAACAGGGCTG<jAGGGAAGGGCACAGGGGCGGCCGGAGCTA 

AGATGTGGTGCTTTTCTTGGTTGTGTTTTTCTTTGITCTTCTTCm^ 
AAGGAAAAGAGAAATACATAAAAAAGTGTTTACCGTGAAATTCAGGATGCTGTGATAC 

ACTGAAAGGAAAAATGTACTTAAAGGGTTTTGTTTTGTTTTGG 

AGCTCCTCCAGTGAAGTAGCCTCTTGTGTAACTAATTTGTGGTAAAGTAGTTGATTCAG 

CCCTCAGAAGAAAACTTTTGTTTAGAGCCCTCCGTAAATATACATCTGTGTATTTGAGT 

TGGCTTTGCTACCCATTTACAAATAAGAGGACAGATAACTGCTTTGCAAATTCAAGAGC 

CTCCCCTGGGTTAACAAATGAGCCATCCCCAGGGCCCACCCCCAGGAAGGCCACAGTG 

CTGGGCGGCATCCCTGCAGAGGAAAGACAGGACCCGGGGCCCGGCTCACACCCCAGTG 

GATTTGGAGTTGCTTAAAATAGACTCTGGCCTTCAGCAATAGTCTCTCTGCAAGACAGA 

AAGCTCCATCAAACCTCACATTTGTGAACTCAAACGATGTGCAATACATTTTTT^ 

TCCTTGAAAATAAAAAGAGAAACAAGTATTTTGCTATATATAAAGACAACAAAAGAAA 

TCTCCTAACAAAAGAACTAAGAGGCCCAGCCCTCAGAAACCCTTCAGTGCTACATTTT 

GTGGCTTTTTAATGGAAACCAAGCCAATGTTATAGACGTTTGGACTGATTTGTGGAAAG 

GAGGGGGGAAGAGGGAGAAG<jATCATTCAAAAGTTACCCAAAGGGCTTATTGACTCTT 

TCTATTGTTAAACAAATGATTTCCACAAACAGATCAGGAAGCACTAGGTTGGCAGAGA 

CACTTTGTCTAGTGTATTCTCTTCACAGTGCCAG<jAAAGAGTGKjTTTCTGCGTGTGTAT 

ATTTGTAATATATGATATTTTTCATGGTCCACTATTTTATTAAAAATAAA^ 

TAAAAAAA 

Figure 76 

MQBPGPRLWLVLQVMGSCAAISSMDMER^ 

NQPJBAAIQIJIEFAPLVEYGOTGHIJOTLCSLYAPMCTEQVSTPffACIlVMCTQARLKCSPI 
MEQFNFKWPDSLDCRKLP^ 

PLKDGGPGRGGCDr^GEIFHEIVEKSASCAPLCTPGVDVYW 
FSSAFTVLmiDPAPJ^YPFItfllFI^MCY^ 

LESTGCTLVFLVLYYFGMASSLWWVVLTLTWFIAAGKKWGHEAIEANSSYFH^ 
a wttt tt \n>/n>x>v A r?r»T?T TaVHYVOSMT) VNT A T .TGFVT .TPT , A GYTJVIGTSFILSGFVAL 
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FHE^VMKTGGENTDIQJSK^ 
CKMNNQTICTLDC^ 

EKSRRKPASVTTSGGIYKKAQHPQKTHBGKYEIPAQSPTCV 



Figure 77 

CCTOTCACSCCTCCGGAGTCAGTGCCGCGCG^ 

ACCTCCGGGAGCCGGGGCGCACCCAGCCCGCAGCGCCGCCTCCCCGCCCGCGCCGCCT 
CCGACCGCAGGCCGAGGGCCGCCACTGGCCGGGGGGACCGGGCAGCAGCTTGCGGCC 

GGGGAGCCGGGCAACGCTGGGGACTGCGCCTITTGTCCCCGGAGGTCCCT 

GCGGCAGGACGCGCGCGGGGAGGCGGCGGAGGCAGCCCCGACGTCGCGGAGAACAGG 

GCGCAGAGCCGGCATGGGCATGGGGCGCAGCGAGGGGGGCCGCCGCGGGGCCCTGGG 

CGTGCTGCTGGCGCTGGGCGCGGCGCTTCTGGCCGTGGGCTCGGCCAGCGAGTACGAC 

TACGTGAGCTTCCAGTCGGACATCGGCCGGTACCAGAGCGGGCGCTTCTACACCAAGC 

CACCTCAGTGCGTGGACATCCCCGCGGACCTGCGGCTGTGCCACAACGTGGGCTACAA 

GAAGATGGTGCTGCCCAACCTGCTGGAGCACGAGACCATGGCGGAGGTGAAGCAGCA 

GGCCAGCAGCTGGGTGCCCCTGCTCAACAAGAACTGCCACGCCGGGACCCAGGTCTTC 

CTCTGCTCGCTCTTCGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCT 

CTGCGAGGCCGTGCGCGACrCGTGCGAGCCGGTCATGCAGTTCTTCGGCTTCTACTGGC 

CCGAGATGCTTAAGTGTGACAAGTTCCCGGAGGGGGACGTCTGCATCGCCATGACGCC 

GCCCAATGCCACCGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCGTCCCTGTGAC 

AACGAGTTGAAATCTGAGGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGCACTGA 

GGATGAAAATAAAAGAAGTGAAAAAAGAAAATGGCGACAAGAAGATTGTCCCCAAGA 

AGAAGAAGCCCCTGAAGTTGGGGCCCATCAAGAAGAAGGACCTGAAGAAGCTTGTGC 

TGTACCTGAAGAATGGGGGTGACTGTCCCTGCCACCAGCTGGACAACCTCAGCCACCA 

CTTCCTCATCATGGGCCGCAAGGTGAAGAGCCAGTACTTGCTGACGGCCATCCACAAG 

TGGGACAAGAAAAACAAGGAGTTCAAAAACTTCATGAAGAAAATGAAAAACCATGAG 

TGCCCCACCTTTCAGTCCGTGTTTAAGTGATTCTCGCGGGGGCAGGGTGGGGAGGGAG 

CCTCGGGTGGGGTGGGAGCGGGGGGGACAGTGCCCGGGAACCCGTGGTCACACACAC 

GCACTGCCCTGTCAGTAGTGGACATTGTAATCCAGTCGGCTTGTTCTTGCAGCATTCCC 

GCTCCCTTTCCCTCCATAGCCACGCTCCAAACCCCAGGGTAGGCATGGCCGGGTAAAG 

CAAGGGCCATTTAGATTAGGAAGGTTTTTAAGATCCGCAATGTGGAGCAGCAGCCACT 

GCACAGGAGGAGGTGACAAACCATTTCCAACAGCAACACAGCCACTAAAACACAAAA 

AGGGGGATTGGGCGGAAAGTGAGAGCCAGCAGCAAAAACTACATTTTGCAACTrGTTG 

GTGTGGATCTATTGGCTGATCTATGCCTTTCAACTAGAAAATTCTAATGATTGGCAAGT 

CACGTTGTTTTCAGGTCCAGAGTAGTTTCTTTCrGTCTGCTTTAAATGGAAACAGACTC 

ATACCACACTTACAATTAAGGTCAAGCCCAGAAAGTGATAAGTGCAGGGAGGAAAAG 

TGCAAGTCCATTATCTAATAGTGACAGCAAAGGGACCAGGGGAGAGGCATTGCCTTCT 

CTGCCCACAGTCTTTCCGTGTGATTGTCTTTGAATCTGAATCAGCCAGTCTCAGATGCC 

CCAAAGTTTCGGTTCCrATGAGCCCGGGGCATGATCTGATCCCCAAGACATGTGGAGG 

GGCAGCCTGTGCCTGCCTTTGTGTCAGAAAAAGGAAACCACAGTGAGCCTGAGAGAGA 

CGGCGATTTTCGGGCTGAGAAGGCAGTAGTTTTCAAAACACATAGTTA 



Figure 78 

MGIGRSEGGRRGAALGVLLALGAAL^^^ 

PADUtf£HNVGYKKMVI^^ 

CUDRPIYPCRWLCEAVPODSCEPVMQFFGFYWPE^^ 

QGTTVCPPCDNELKSEAIEHLCASEFGl^IJ^^ 

T xrcTo^xTT uppr 4hap <3PTv/n?T TTTP.ATPT .ALGRSAPGLFADCPERPLPVCSFPH 
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HTEEVGIOJUHSELLQVKGFSMKGI^^ 
PPAERCKDEEDKAMFSK 



Figure 79 

r A ATTrGTTCAGCCTGGTTAAGTCCAAGCTGGCT 

AgWgCGGGCCCGGGACAAGCTCGAACTC^^ 

CCCTCTACCCCCTAGGGGTCGCGC^ 

rrrrrccTCGCCTCXSCAC^ 

TCCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGG^ 

AACGCnTTGTAAAAATGATTTTGCACTGAAAATAAAAGTGAAGGAGATA^ 
AACCGT 



Figure 80 

MLQGPGSLLLIJLASHCC^^ 

^hss^sssssssss^sssssss seeks 

DIMETIXKl^FAlJaKVKEITYINR 
Figure 81 

rrGGGTCGGAGCCCCCGGGAGCTGCGCGCGGGCTTGCAGCGCCTCGCCCGCGCTGTCC 

GTCGCACCCAGCGAAGAGAGCGGGCCCGGGACAAGCTCG^CTrc 
TTrrrrGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCACGATGCTGCAGGGCC 

CTCTTCCTCTTTGGCCAGCCCGACTTCTC^ 
CCTGCCAACCTGCAGCTGTGCCACGGCATCG^ 
TGCTGGGCCACGAGACCATGAAGGAGGTGCTGGAGCAG^ 
GGTCATGAAGCAGTGCCACCCGGACACCAAGAAGTTCCTCT 

GTCTGCCTCGATGACCTAGACGAGACCATCCAGCCATGCCACrcGCTCTGCGTG^AG 

GAAGGACCCXTGCGCCCCGGTCATGTCCGC^^ 

AGTGCGACCGTTTCCCCCAGKjACAACGACCTITGCATCCCCCT^ 

CCTCCTGCCAGCCACCGAGGAAGCTCGAAAGGTATGTGAAGCCTCCAAAAATA^ 

GATGATGACAA_CGAGATAATGGAAACGC^ 
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GCAAGACCATTTACAAGCTGAACGKjTGTGTCCGAAAGKK^ 

^GTCTCAAAGA^^^ 
?I^rr^TGGGACAGAAAC^^ 

cSagc?S 

GCCACAGGAGTGGATAGCTGTTTTCACCTAAAGGAAAAGCCCAC^ 
ATCTTGTAGAAATATTCAAACTAATAA^ 



Figure 82 

DNDDvffiTXCKl^FAIiCIKVKEITYIN^ 

QCTCEEMNDINAPYLVMQQK(^GELVmVKRWQKa5REFKiaSRSmKLQC 
Figure 83 

ACGGGGCCTGGGCGGSAGGGGCGGTGGCTGGAGCTCGGT 

TGGGGGAATITGATCCAAGGAAGCGGT^^^ 
^GTGTCCACTTGC^GCGGGGGAGGCGGAGACGCGGAGCG^ 

^gctgctgctgcgggccgggctgcttgccctggctgctctct 
cggggctcgggctc^agcctgtgagcccgtccgcatccc^ 

cScSctgS 

AGCTACACAGAAGACCTATTTCCCMAACAAYACAACTATCT^ 
AGTGATTCTAGCAATAGTGATTCCACTCAGAGTCAGAAGTCTGGGAGGAACTCGAACC 



ACTCGGTAAAAAAGTT. 



COXKK^GCACa^ACTAAATCCCGAAATACA^ 
TAAGACITACTTGCArrGCT<3GACTAGCA^ 

OTGAG^^^T^^"C^^^^CAT^^AAA^CTGTTCTrTTGCAATAATAATAAATT 
AAACATGCTGTTA 



Figure 84 
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MVCGSPGGMIIXRAGIXAI^ 
STQANAILAffiQFEGLmraCSPDIXFTI^^ 

IKYI^SWPE^CEEIPVYDRGVCISPEAWTADGADFPMDSSNGNC^GASSERCKCKPm 
ATQKTYFRNNYNYVKAK^^ 

PPLNV1^EYIMGYEDEER5RLLLVEGSIAEKWKDRLGKKVE31WD 
SDSTQSQKSGKNSNPRQARN 



Figure 85 

CAGCGGCCGCTGAATTCTAGGGCGGGTrGGCGCCCCGAAGGCTGAGAGCTGGCGCTGC 

TCGTGCCCTGTGTGCCAGACGGCGGAGCTCCGCGGCCGGACCCCGCGGCCCCGCrTTG 

CTGCCGACTGGAGTITGGGGGAAGAAACTCTCCTGCGCCCCAGAAGATTTCTTCCTCGG 

CGAAGGGACAGCGAAAGATGAGGGTGGCAGGAAGAGAAGGCGCTTTCTGTCTGCCGG 

GGTCGCAGCGCGAGAGGGCAGTGCCATGTTCCTCTCCATCCTAGTGGCGCTGTGCCTGT 

GGCTGCACCTGGCGCTGGGCGTGCGCGGCGCGCCCTGCGAGGCGGTGCGCATCCCTAT 

GTGCCGGCACATGCCCTGGAACATCACGCGGATGCCCAACCACCTGCACGACAGCACG 

CAGGAGAACGCCATCCTGGCCATCGAGCAGTACGAGGAGCTGGTGGACGTGAACTGC 

AGCGCCGTGCTGCGCTTCTTCTTCTGTGCCATGTACGCGCCCATTTGCACCCTGGAGTT 

CGTGCACGACCCTATCAAGCCGTGCAAGTCGGTGTGGCAACGCGCGCGCGACGACTGC 

GAGCCCCTCATGAAGATGTACAACCACAGCTGGCCCGAAAGCCTGGCCTGCGACGAGC 

tgcctgtctatgaccgtggcgtgtgcatttcgcctgaagccatcgtcacggacctcccg 
gaggatgttaagtggatagacatcacaccagacatgatggtacaggaaaggcctcttg 
atgttgactgtaaaggcctaagccccgatcggtgcaagtgtaaaaaggtgaagccaac 

tttggcaacgtatctcagcaaaaactacagct^^ 

tgcagaggagtggctgcaatgaggtgagaacggtggtggatgtaaaagagatcttcaa 
gtcctcatcacccatccctcgaactcaagtcccgctcattacaaattcttcttgccagt 

GTCCACACATCCTGCCCCATCAAGATGTTCTCATCATGTGTTACGAGTGGCGTTCAAGG 

ATGATGCTTCTTGAAAATTGCTTAGTTGAAAAATGGAGAGATCAGCTTAGTAAAAGAT 

CCATACAGTGGGAAGAGAGGCTGCAGGAACAGCGGAGAACAGTTCAGGACAAGAAGA 

AAACAGCCGGGCGCACCAGTCGTAGTAATCCCCCCAAACCAAAGGGAAAGCCTCCTGC 

TCCCAAACCAGCCAGTCCCAAGAAGAACATTAAAACTAGGAGTGCCCAGAAGAGAAC 

AAACCCGAAAAGAGTGTGAGCTAACTAGTTTCCAAAGCGGAGACTTCCGACTTCCTTA 

CAGGATGAGGCTGGGCATTGCCTGGGACAGCCTATGTAAGGCCATGTGCCCCTTGCCC 

TAACAACTCACTGCAGTGCTCTTCATAGACACATCTTGCAGCATTTTTCTTAAGGCTAT 

GCTTCAGTTTTTCTTTGTAAGCCATCACAAGCCATAGTGGTAGGTTTGCCCTTTG 

GAAGGTGAGTTAAAGCTGGTGGAAAAGGCTTATTGCATTGCATTCAGAGTAACCTGTG 

TGCATACTCTAGAAGAGTAGGGAAAATAATGCTTGTTACAATTCGACCTAATATGTGC 

ATTGTAAAATAAATGCCATATTTCAAACAAAACACG 

1TACCTTTTGATATCTGTTGTTGCAATGTTAGTGATGTTTTAAAATGTGATGAAAATATA 
ATGTTTTTAAGAAGGAACAGTAGTGGAATGAATGTTAAAAGATCITrATGTGTTTATGG 
TCTGCAGAAGGATTTTTGTGATGAAAGGGGATTTITTGAAAAATTAGAGAAG 
ATGGAAAATTATAATGTGTTTTTTTACCAAT^ 

TTAAAAACAAAAATAATAATAAAGAAAAATAAATAAAAAGGAGAGGCAGACAATGTC 

TGGATTCCTGTTTTTTGGTTACCTGATTTCCATGATCATGATGCTTCTTGTCAACACCCT 

CTTAAGCAGCACCAGAAACAGTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAATTA 

GTTGGCTAATGCTCAAGTATTTTATACCCACAAGAGAGGTATGTCACTCATCTTACTTC 

CCAGGACATCCACCCTGAGAATAATTTGACAAGCTTAAAAATGGCCTTCATGTGAGTG 

CCAAATTTTGTTTTTCTTCATTTAAATATTTTG 

AAATATAAATGTACAGAGAGGAAAGTTGAGTTCCACCTCTGAAATGAGAATTACTTGA 
CAGTTGGGATACTTTAATCAGAAAAAAAGAACTTATTTGCAGCATTTTATCAA 

rr^^ a m i a nvn^^^ a ^ a a mnn^^ a r^n.C* A TTT A TTTT A A A A H APA A TTTTT* A TTClCmCTTTTGCT 
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AACACAGTAAGCATGTATTTTATAAGGCATTCAATAAATGCACAACGCCCAAAGGAAA 

TAAAATCCTATCTAATCCTACTCTCCACTACACAGAGGTAATCACTATTAGTATTTTGG 

CATATTATTCTCCAGGTGTTTGGTTATGCACTTATAAAATGATTTGAACAA 

AGGAACCTGTATACATGTGTTTCATAACCTGCCTCCTTTGCTTGGCCCTTTATTGAGATA 

AGTTTTCCTGTCAAGAAAGCAGAAACCATCTCATTTCTAACAGCTGTGTTATATTCCAT 

AGTATGCATTACTCAACAAACTGTTGTGCTATTGGATACTTAGGTGGTTTCTrCA 

CAATACTGAATAAACATCTCACCGGAATTC 

Figure 86 ~ 

MFimVALCLWUEAIXjVRGAPC^^^ 

EELVDWCSAVLRFFLCAMYAPICTLEFLHDP 

SLAODELPV\T)RGVCISPEAIVTDLPEDVKWIDITPDMMYQE 

VEPTIATYI^KOTSYVIEL^KEKAVQRSGCNEV 

PHILPHQDVLIMCYEWRSRMMI^ 

AGRTSRSNPPKPKGKPPAPKPASPKKMKTRSAQKRTNPKRV 



Figure 87 . 

AAGCTTGATATCGAATTCGCGGCCGCGTCGACGKjGAGGCGCCAGGATCAGTCGGGGCA 
CCCGCAGCGCAGGCTGCCACCCACCTGGGCGACCTCCGCGGCGGCGGCGGCGGCGGCT 

gggtagagtcagggccgggggcgcacgccggaacacctgggcggccgggcaccgagc 

GTCGGGGGGCTGCGCGGCGCGACCCTGGAGAGGGCGCAGCCGATGCGGGCGGCGGCG 

gcggcggggggcgtgcggacggccgcgctggcgctgctgctgggggcgctgcactgg 

GCGCCGGCGCGCTGCGAGGAGTACGACTACTATGGCTGGCAGGCCGAGCCGCTGCACG 

GCCGCTCCTACTCCAAGCCGCCGCAGTGCCTTGACATCCCTGCCGACCTGCCGCTCTGC 

CACACGGTGGGCTACAAGCGCATGCGGCTGCCCAACCTGCTGGAGCACGAGAGCCTGG 

CCGAAGTGAAGCAGCAGGCGAGCAGCTGGCTGCCGCTGCTGGCCAAGCGCTGCCACTC 

GGATACGCAGGTOTCCrGTGCTCGCTCTTTGCGCCCGTCTGTCTCGACCGGCCCATCT 

ACCCGTGCCGCTCGCTGTGCGAGGCCGTGGGCGCCGGCTGCGCGCGGCTCATGGAGGC 

CTACGGCTTCCCCTGGCCTGAGATGCTGCACTGCCACAAGTTCCCCCTGGACAACGACC 

TCTGCATCGCGGTGCAGTTCGGGCACCTGCCCGCCACCGCGCCTCCAGTGACCAAGATC 

TGCGCCCAGTGTGAGATGGAGCACAGTGCTGACGGCCTCATGGAGCAGATGTGCTCCA 

GTGACTTTGTGGTCAAAATGCGCATCAAGGAGATCAAGATAGAGAATGGGGACCGGA 

AGCTGATTGGAGCCCAGAAAAAGAAGAAGCTGCTCAAGCCGGGCCCCCTGAAGCGCA 

AGGACACCAAGCGGCTGGTGCTGCACATGAAGAATGGCGCGGGCTGCCCCTGCCCACA 

GCTGGACAGCCTGGCGGGCAGCTTCCTGGTCATGGGCCGCAAAGTGGATGGACAGCTG 

CTGCTCATGGCCGTCTACCGCTGGGACAAGAAGAATAAGGAGATGAAGTTTGCAGTCA 

AATTCATGTTCTCCTACCCCrGCTCCCTCTACTACCCTTTCTTCTACGGGGCGGCAGAGC 

CCCACTGAAGGGCACTCCTCCTTGCCCTGCCAGCTGTGCCTTGCTTGCCCTCTGGCCCC 

GCCCCAACTTCCAGGCTGACCCGGCCCTACTGGAGGGTGTTTTCACGAATGTTGTTACT 

GGCACAAGGCCTAAGGGATGGGCACGGAGCCCAGGCTGTCCTTTTTGACCCAGGGGTC 

CTGGGGTCCCTGGGATGTTGGGCTTCCTCTCTCAGGAGCAGGGCTTCTTCATCTGGGTG 

AAGACCTCAGGGTCTCAGAAAGTAGGCAGGGGAGGAGAGGGTAAGGGAAAGGTGGAG 

GGGCTCAGGGCACCCTGAGGCGGAGGTTTCAGAGTAGAAGGTGATGTCAGCTCCAGCT 

CCCCTCTGTCGGTGGTGGGGCCTCACCTtGAAGAGGGAAGTCTCAATATTAGGCTAAG 

CTATTTGGGAAAGTTCTCCCCACCGCCCCTGTACGCGTCATCCTAGCCCCCCTTAGGAA 

AGGAGTTAGGGTCTCAGTGCCTCCAGCCACACCCCCTGCCTTCCCCAGCTTGCCCATTT 

CCCTGCCCCAAGGCCCAGAGCTCCCCCCAGACTGGAGAGCAAGCCCAGCCCAGCCTCG 

np ata a. a rnpprTTr^r^Trrr^rraTGr^TnGATTCCCGGGATTCATTCCTCAGCCTC 
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TOTTTCTCCCTTTTATCC 

CGACGCGCGGCCGCGGAATTCCTGCAGCCCGGGGGATCCCCGGGTACCGAGCTCGAAT 
TC 

Figure 88 
TEnJALCVLIHHTDVN^ 

AALRAVGM\nTGTDEQTQVVLNCDAI^HFPALLTHPKE 
AVIDANLWMIIHLIJDKGDFGTQKEAAWAISmTISGRKDQVAYLIQQNW 

AQWQWIJ^GLSNILKMAEDEAETIGNLffiECGGO^ 
DDIDEDPSLVPEAIQGGTFGFNSSANVPTEGFQF 



Figure 89 

ATGCATCTCCTCTTATTTCAGCTGCTGGTACT 

CCAGGATGGCCGCCAGAATCAGAGTTCTCTTTGCCCCGTACTCGT^ 

AGAGAGCTTCCCACAGGCAACCATGAGGAAGCTGAGGAGAAGCCAGATCTGTTTGTCG 

CAGTGCCACACCTTGTAGCCACCAGCCCTGCAGGGGAAGGCCAGAGGCAGAGAGAGA 

AGATGGTGTCCAGATTTGGCAGGTTGTGGAAGAAGCCTGAGAGAGAAATGCAIGC 

CAGGGACTCAGATAGTGAGCCCTTCCCACGrGGGACCCAGTCCCTCATCCAGCCGATA 

GATGGAATGAAAATGGAGAAATCTCGTCTTCGGGAAGAAGCCAAGAAATTCTG 

ACTTCATGTTCAGAAAAACTCCGGCTrCTCAGGGGGTCATCTTGGCCATCAAAAGC^ 

GAAGTACATTGGGAGACCTGCAGGACAGTGCCGrTCAGGGAGACTATAACCCACGAAG 

GCTGTGAAAAAGTAGTTGTTCAGAACAACCTTTGCTTTGGGAAATGCGGGTCTG 

TTTCCTGGAGCCGCGCAGCACTCCCATACCTCCTGCTCTCACTGTTTGCCTGCCAAG^C 

ACCACGATGCACTTGCCACTGAAGTGCACTGAACTITCCTCCGTGA^ 
GCTGGTGGAGGAGTGCCAGTGCAAGGTGAAGACGGAGCATGAAGATGGACACATCCT 

ACATGCTGGCTCCCAGGATTCCTTTATCCCAGGAGTTTCAGCTTGA 

Figure 90 
MHLLLFQLLVIJLPLGK^ 

HLVATSPAGEGQRQREKMLSRFGRFWKKPEPJEMHPSRDSDSEPFTPGT^ 
KSPIREEAKKFWHHFMFRKTPASQGVIIPIKSHEVH^ 

NLCFGKCGSVHFPGAAQHSHTSCSH(XPAKFTTMHLPLNCTEI^SVIKVVMLVEECQCKV 
KTEHEDGHILHAGSQDSFIPGVSA 



Figure 91 

CGGCACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACG^ 
CCCTCTTGAGTCCTTCTGAGATGATGGCTCTGGGCGCAGC 

GTCGCGATGGTAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCT 

TGAACTCGGTTCTCAATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCT 

GCGGGGCACCCAGGCTCTGGAGTCAGCGCCGCGGCGGGAATCCTGTACCCGGG^ 

ataagtaccagaccattgacaactaccackxgtacccgt^^ 

CGGCACTGATGAGTACTGCGGTAGTCCCA^ 
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GGAATTACTGCAAAAATGGAATATGTGTGTCITCTGATCAAAATCAmCCGAGGAGA 

AATTGAGGAAACCATCACTGAAAGCTITGGTAATGATCATAGCACCTTGGATGGGTAT 

TCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAGGTTCTG 

TTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCA 

AGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGG 

CTCTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGA 

TACAGAAAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACAC 

TAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACCTT 

TTATGACCITCATCAACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCA 

TTCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTrCTTTATG^ 

CCTGTGATTGCAGTAAATtACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAAT 

GCAATGAAACTTTTAATTATTTTTCT 

TGTAAATrTTTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGA 
AGTAAATCATTTCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAG 
AGTCTAGAACGCAAGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACAT 
GAAAATACTAGCrrATTTTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCm 

AGGCTGTGATAGTTTTTGAAATAAA 
GACAT 



Figure 92 

MMALGAAGATRVFVAMVAAALGGOTLLGVSATLNSVLNSNAIKNIJPPLGGAAGHPGSA 
VSAAPGILYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQICLAaOCRRK 

RCMRHAMC&GNYCKNGICVSSDQNHF^ 

TKGQEGSVCIJtSSDCASGLCCARHFWSKIC^ 

EGLSCRIQKDHHQASNSSRLHTCQPvH 



Figure 93 

GCGGGTCTCGCTTGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGG 

GTCCTGGGTCCCCGAACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGAC 

CCAAGTGAGGGGCCCCGTGTTGGGGTCCTCCCTCCCTTTGCATTCCCAGCCCTCCGGGC 

rrTGCGTCTTCCTGGGGACCCCCTCGCCGGGAGATGGCCGCGTTGATGCGGAGCAAGG 

ATTCGTCCTGCTGCCTGCTCCTACTGGCCGCGGTGCTGATGGTGGAGAGCTCACAGATC 

GGCAGTTCGCGGGCCAAACTCAACTCCATCAAGTCCTCTCTGGGCGGGGAGACGCCTG 

GTCAGGCCGCCAATCGATCTGCGGGCATGTACCAAGGACTGGCATTCGGCGGCAGTAA 

GAAGGGCAAAAACCTGGGGCAGGCCTACCCrTGTAGCAGTGATAAGGAGTGTGAAGTT 

GGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGCATGGTGTGTCGGAGAA 

AAAAGAAGCGCTGCCACCGAGATGGCATGTGCTGCCCCAGTACCCGCTGCAATAATGG 

CATCTGTATCCCAGTTACTGAAAGCATCTTAACCCCTGACATCCCGGCTCTGGATGGTA 

CTCGGCACAGAGATCGAAACCACGGTCATTACTCAAACCATGACTTGGGATGGCAGAA 

TCTAGGAAGACCACACACTAAGATGTCACATATAAAAGGGCATGAAGGAGACCCCTGC 

CTACGATCATCAGACTGCATTGAAGGGTTTTGCTGTGCTCGTCATTTCTGGACCAAAAT 

CTGCAAACCAGTGCTCCATCAGGGGGAAGTCTGTACCAAACAACGCAAGAAGGGTTCT 

CATGGGCTGGAAATTTTCCAGCGTTGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATG 

GAAAGATGCCACCTACTCCTCCAAAGCCAGACTCCATGTGTGTCAGAAAATrTGATCA 

CCATTGAGGAACATCATCAATTGCAGACTGTGAAGTTGTGTATTTAATGCATTATAGCA 

TGGTGGAAAATAAGGTTCAGATGCAGAAGAATGGCTAAAATAAGAAACGTGATAAGA 

ATATAGATGATCAC 
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. Figure 94 

tr^GICPVTES^^ 

DPCI*SSI*I^^ 
WKDATYSSKARLHVCQKI 

Figure 95 

CTATCACAATGAGACCAACACAGACACGAAGGTTGGAAATAATACCATCCATGTGCAC 
CGAGAAATTCACAAGATAACCAACAACCAGACTGGACAAA^ 

TTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCCACGAGTGCATCATCGACG 
AGGACTGTGGCTCCCAGCATGTAC^^ 

ATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCAGCTG 

TGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGAC 

AGAACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTT 

CCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGG 

CTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATGGAGGCTTGGACCGATGCCCTTG 

TGCCAGTGGCCTCCrCTGCCAGCCCCACAGCCACAGGCTGGTGTATGTGTGCAAGCCG 

ACCTTCGTGGGGAGCCGTGACCAAGATGGGGAGATCCTGCTGCCCAGAGAGGTCCCCG 

ATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCGCCAGGAGCTGGAGGACCTGGA 

CTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAATAGAAAT 

agctaatttatttccccaggtgtgtgctttaggcgtgggctGaccaggcttcttcctac 

AGCTCCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGG 

GTTAAACTGCAGGAGCAGTTTGCCACCCCTG 
CAGTTGGCAGACAGCCGTTTGTTCTACATGC^^^ 

GGAAACAATGTGGAGTCTCCCTCTGATTGGrmGGGGAAATGTGGAGAAGA^^^ 
TGCmGCAAACATGAACCTGGCAAAAATGCAACAAATGAATTTTCCACGCAGTTCTTT 

GCATTACATGTGITI^^TCATCCAGCAGTO 

GrrCTCCTCGTCCATCAGGGATCTCAGAGGNCTCAGAGACTGCAAGCTGOT^ 

GTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAA 

TCTCTCCACTACCX^CACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGG 

AGAATGGGATTTTTCITTTGAGGCATGCACATCT 

CATCCCTCTAAAAGTAAACTACTGTTAGGAACAGCAGTG^CTC^ 

CCGTCCTTCTAATGAAGACAATGATATTGACACTGTCCCTC 

AACTTTGAAAGGTATATGACTGAGCGTAGCATACAGGTTAACCTGCAGAAACAGTACT 

TAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGCAAAATCACTTAGCAGCAACT 

GAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGGTGTOT 

GGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCA 

GGTGTCATGGACTGTTGGCACCATGTATTCATCCAGAGTTCTTAA^ 

CATGATTGTATAAGCATGCTTTCTTTGAGTTTTAAATTATGTATAAACATAAGTTGCAT^ 

TAGAAATCAAGCATAAATCAC 
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Figure 96. 

pSK^I^^RCTCASGI^QPHSHSLVYVCKPIWGSRDQDGBILIPREVP 
DEYEVGSFMEEVRQELEDLERSLTEEMALGEPAAAAAALLGOEEI 



Figure97 

AGACGACGTGCTGAGCTGCCA(XnTAGTGGAAG^ 

CGCTITGGTGACGCACAGTGCTGGGACCCTCCAGGA^ 

GCGGCCGTCCTGCTGGGGCTGAGCTGGCTCTGC^ 

rTTrAACAACATCAGGAGCTCTGCTGACCTGGATGGGGCCCGGAAGGGCTCACAGTGC 
rGTTCTGTGCTACATGTCGTGGGTTGCGGAGGAGGTGCCAGCGAGATGCCATGTGCTG 

TAGAAAGGCAGCTTGATGAGCAAGATGGCACACATGCAGA^ 

CAGTCCAGGAAAACCAACCCAAAA<X3AAGCCAAGTATC^^ 

AGGGACAAGAGGGAGAAAGTTGTCTGAGAACTTTTGACTCTGC^ 

tGctcgtcatttttggacgaaaatttgtaagccagtc 

CCAGAAGAGGGCATAAAGACACTGCTCAAGCTCCAGAAATGTTCCAG^ 
TGGCCCTGGACTACTGTGTCGAAGCCAAT^^ 

AGAGTATGCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAATCCACAT 
TGC 

Figure 98 

KDTAQAPEIFQRCDCGPGLLCRSQLTSNRQHARI^VCQKIEKL 



Figure 99 

AGGCAGAATACTTCTATGAATTCCTCTCCTTGCGCTCCC^ 

ArroT^TC^^CTGAAGGCAACACCATrCTCCAAACACCTCAAAATGClATC^Crr 
TAAAA^JCT^AACA^GCTGAGTGCCCAGGCGGGTGCCGAAATGGAGGCTTTTGTAAT 

GAAAG^GCA^CTGCGAGTGTCCTGATGGGTTCCACGGACCrc^ 
TTTfTrArrCCACGATGTATGAATGGTGGACTTTGTGTGACTCCTGGTTTCTGCATCTGCC 
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AGTGTGAAATCAGCAAATGCCX3AC 

AAGCAAATGTAAGTGTTCCAAAGGTTACCAGGGAGACCTCTGTTCAAAGCCT 
GAGCCTGGCTGTGGTGCACATGGAACCTGCCATGAACC^ 

aaggttcxkatggaagacactgcaataaaagg^^ 

GAGCGCAGCAGCGCCCAGCTCAGGCAGCACACGCCTTC^^ 

CGGGGGCATCCACCTGAATCCAATTACATCTGGTGAACTCCGACAT 

AGTTACACCAAGTTCATAGCCTITGTTAACCTTTCATGTGTOAAT^^ 

CATTACACTTAAGAATACTGGCCTGAATTTTATTA 

ATATTTACTCTTCCTTITAAGTm^ 

OTGTGTAGTTGGCAGATATITrCAAAATTACAATGCA™ 
GAACATCAGAAAGGTTAAATTGGGCAAAAfVTGCGI 

CAGTT^TGTTGAAGTTACAGCATTTCAGATTTTATTGTCAGATATTTAGATGTTTGTTA 

CATITITAAAAATTGCTCTTAATTTTTAAA 
TTATTCCAGAGATTCAGTATTAAAAAAAAAAA^^ 

ACAATATAATATATTCTAAACACAATGAAATAGGGAATATAATGTATGAACTTTTTGCA 

TTGGCTTGAAGCAATATAATATATTGTAAACAAAACACAGCTCTrACCTAATAAACA 

TTATACTGTTTGTATGTATAAAATAAAGGTGCTGCTTTAGTTTTC 



Figure 100 

MARRS AFP AAALWLWSILL(XLALRAEAGPPQEESL^ 
MAPFTHDFRKAQQRMPAIPVNIHSMNFrWQAAGQAEYFTO 

LLGTVPHKASWQVGFPCI£rKQDGVAA^ 
GGCRNGGFCNERRICECPDGraGPHCE^ 

ANCSTTCFNGGTCFYPGKCICPPGLEGEQCEISKCPQPCKNGGKCIGKSKCKCSKGYQGDL 

CSKPVCEPGCGAHGTCHEPNKCQCQEGWHGRHCNKRYEASLIHALR 

PAGAQIJRQHTPSLKKAEERRDPPESNYIW 

Figure 101 

ATGGGCATCGGGCGCAGCGAGGGGGGCCGCCGCGGGGCAGCCCTC 

GCGCTGGGCGCGGCGCTTCTGGCCGTGGGCTCGGCCAGCGAGTACGACT^ 

TCCAGTCGGACATCGGCCCGTACCAGAGCGGGCGCTTCTACACCAAGCCACCTCAGTC 

CGTGGACATCCCCGCGGACCTGCGGCTGTGCCACAACGTGGGC^ 
CTGCCCAACCTGCTGGAGCACGAGACCATGGCGGAGGTGAAGCAGCAGGCCAGCAGC 

TGGGTCGCCCT^ 

CTTCGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCTCTGCGAGGCCG 

TGCGCGACTCGTGCGAGCCGGTCATGCAGTTCTTCGGCTTCTACTGGCCCGAGATGCTT 

AAGTGTGACAAGTTCCCCGAGGGGGACGTCTGCATCGCCATGACGCCGCCCAATGCCA 

CCGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCCTCCCTGTGACAACGAOTGAA 

ATCTGAGGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGGGCTGAGTTTAAA^^TGA 

TTGTGGGTAGCTCCCATAACTCATGCTGCACGCTGGGTCCTTCTCATCCCAACTGCTCA 

AAGCGGCAGGAGCAGGAACTGGGGACTCCTGAGAGAAGGCTTGGATATGGCC^AT 

TACACTTCATCCAAGGAAATCTGCCCCCACCCTGTGCCCAGGCCCGATCACGCATGAG 

GCTAAAGACGGAGGCCACTCCOCTGGCTCTGGGTAGATCTGCCCCT^ 

GACTGCCCGGAGCGCCCTCTGCCGGTCTGCAGCTTCCCACACCACACGGAAGAAGTGG 

GGAAACTGAGGATACATTCTTTCCTCCTCCAGGTAAAGGGATTCTCAATGAAGGGCT 

TGTGCACCTTCCACACTTAGATACCTCTACHACCT^ 

TCAAGAGTACCAGGCACATAGTGCTCAAGTCTGGGCTAATATGCCACCTGCAGAGAGA 
. . .m^i «n* *p.Ari a a nor A TYVrTTTf 1 AAA CtTG A 
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Figure 102 

MGIG^EGGRRGAALGVIXAIXMALLAVGSASEYDYVSFQSDIGPYQSGRFY 
PADI*IX2IrWGYK12vf^ 

CIX>PJ»IYPC^WI^EAVPJDSCEPVMQFFGFYWPFJV1LKCDKFPEGDVC 
QGTIVCPPCDNELKSEAIffiHIX:ASEFGI^LKMlVGSSmSCCTLGPSHPNSSKRQEQELGTP 

EPJtfX5YGLLmHQGNLPPPCAQAPJS^^ 

HTEEVGKLRIHSFLLQVKGFSMKGLCAPSTLRYLYYIXTSMQHVHQEYQAHSAQWA^JM 
PPAERCKDEEDKAMFSK 



Figure 103 

GGCGGGTTCGCGCCCCGAAGGCTGAGAGCTGGCGCTGCTCGTGCCCTGTGTGCCAGAC 

GGCGGAGCTCCGCGGCCGGACCCCGCGGCCCCGCTTTGCTGCCGACTGGAGTTTGGGG 

GAAGAAACTCTCCTGCGCCCCAGAAGATTTCTTCCTCGGCGAAGGGACAGCGAAAGAT 

GAGGGTGGCAGGAAGAGAAGGCGCTTTCTGTCTGCCGGGGTCGCAGCGCGAGAGGGC 

AGTGCCATGTTCCTCTCCATCCTAGTGGCGCTGTGCCTGTGGCTGCACCTGGCGCTGGG 

CGTGCGCGGCGCGCCCTGCGAGGCGGTGCGCATCCCTATGTGCCGGCACATGCCCTGG 

AACATCACGCGGATGCCCAACCACCTGCACCACAGCACGCAGGAGAACGCCATCCTGG 

CCATCGAGCAGTACGAGGAGCTGGTGGACGTGAACTGCAGCGCGGTGCTGCGCTTCTT 

CTTCTGTGCCATGTACGCGCCCATTTGCACCCTGGAGTTCCTGCACGACCCTATCAAGC 

CGTGGAAGTCGGTGTGCCAACGCGCGCGCGACGACTGCGAGCCCCTCATGAAGATGTA 

CAACCACAGCTGGCCCGAAAGCCTGGCCTGCGACGAGCTGCCTGTCTATGACCGTGGC 

GTGTGCATTTCGCCTGAAGCCATCGTCACGGACCTCCCGGAGGATGTTAGTGGATAGA 

GATCACACCAGACATGATGGTACAGGAAAGGCCTCTTGATGTTGACTGTAAACGCCTA 

AGCCCCGATCGGTGCAAGTGTAAAAAGGTGAAGCCAACTTTGGCAACGTATCTCAGCA 

AAAACTACAGCTATGTTATTCATGCCAAAATAAAAGCTGTGCAGAGGAGTGGCTGCAA 

TGAGGTCACAACGGTGGTGGATGTAAAAGAGATCTTCAAGTCCTCATCACCCATCCCTC 

GAACTCAAGTCCCGCTCATTACAAATTCTTCTTGCCAGTGTCCACACATCCTGCCCCAT 

CAAGATGTTGTCATCATGTGTTACGAGTGGCGTTCAAGGATGATGCTTCTTGAAAATTG 

CTTAGTTGAAAAATGGAGAGATCAGCTTAGTAAAAGATCCATACAGTGGGAAGAGAG 

GCTGCAGGAACAGCGGAGAACAGTTCAGGACAAGAAGAAAACAGCGGGGCGCACCAG 

TCGTAGTAATCCCCCCAAACCAAAGGGAAAGCCTCCTGCTCCCAAACCAGCCAGTCCC 

AAGAAGAACATTAAAACTAGGAGTGCCCAGAAGAGAACAAACCCGAAAAGAGTGTGA 

GCTAACTAGTTtCCAAAGCGGAGACTTCCGACTTCCTTACAGGATGAGGCTGGGCATTG 

CCTGGGACAGCGTATGTAAGGCCATGTGCCCCTTGCCCTAACAACTCACTGCAGTGCTC 

TTCATAGACACATCTTGCAGCATTTTTCTTAAGGCTATGCTTCAGTTTTTCTT^ 

CATCACAAGCCATAGTGGTAGGTTTGCCCTTTGGTACAGAAGGTGAGTTAAAGCTGGT 

GGAAAAGGCTTATTGCATTGCATTCAGAGTAACCTGTGTGCATACTCTAGAAGAGTAG 

GGAAAATAATGCTTGTTACAATTCGACCTAATATGTGCATTGTAAAATAAATGCCATAT 

TTCAAACAAAACACGTAATTTTTTTACAGTATGTTTTATTACCnm 
GCAATGTTAGTGATGTTTTAAAATGTGATGAAAATATAATGTTTTTAAGAAGGAACAGT 

AGTGGAATGAATGTTAAAAGATCTTTATGTGTTTATGGTCTGCAGAAGGATTTTTG 
TGAAAGGGGATTTTTTGAAAAATTAGAGAAGTAGCATATGGAAAATTATAATGTGTTT 

TTTTACCAATGACHTCAGTTTCTGTTT1TA 

a a AfrA a A A A T AAATA AAAAGGAGAGGGAGACAATGTCTGGATTCCTGTTTTTTGGTTA 
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CCTGATTTCCATGATCATGATGCTTCTTGTCAACACCCTCITAAGCAGCACCAGAA^ 
GTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAATTAGTTGGCTAATGCTCAAGT 
AiTTTATACCCACAAGAGAGGTATGTCACTCATCTTACTTCCCAGGACATCCACCCTGA 
GAATAATTTGACAAGCTTAAAAATGGCCTTCATGTGAGTGCCAAATTTTGT^ 

attraaatattttctttgcctaaatacatgtgagaggagttaaatataaatgtacagag 
Aggaaagttgagttccacctctgaaatgagaattacttgacagttgggatactttaatc 

AGAAAAAAAGAACTTATTTGCAGCAlTrTATCAACAAATTTCATAATTGT^ 
GAGGCATTTATTTTAAAAAACAATTITATTGGCCTlT^ 

TTTATAAGGCATTCAATAAATGCACAACGCCCAAAGGAAATAAAATeCTATCTAATCC 

TACTCTCCACTACACAGAGGTAATCACTATTAGTATTTTGGCATATTATTCTCCAGGTGT 

TTGCTTATGCACTTATAAAATGATTTGAACAAATAAAACTAGGAACGTGTATACATGTG 

TTTCATA^CCTGCCTCCTITGCTrGGCCCTTTATTGAGATAAGTTTTC 

CAGAAACCATCTCATTTCTAACAGCTGTGTTATATTCCATAGTATGCATTACTCAACAA 

ACTGTTGTGCTATTGGATACTTAGGTGGTTTCTTCACTGACAATACTGAATAAACATCT 

CACCGGAATTC 



Figure 104 
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LACDEIJ'VYDRGVCISPEAIVTDLPEDVKWIDITPDMMVQERPLDVDCKia.SPDRCKCKKV 

KPTLATYLSKJST^SYVIHAKIKAVQRSGCHEVTTV^^ 

fflLPHQDVLmCYEWRSRMMLLENCLVEKWI^QLSKRSIQWEEPXQEQRRTVQDKKKTA 
GRTSRSNPPKPKGKPPAPKPASPKKNIKTRSAQKRTNPKRV 
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